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1EJXRQDL1CTI0K . 

A central problem in the mathematical theory of 
computers and computation is to find a suitable framework 
for expressing the execution of a computer program by a 
computer. Within the framework wc want to be able to 
provide answers to such questions as; 

(1) Does a certain program perform a certain task? 

(2) Are two programs equivalent, i.e., do they 
perform the same task? 

(3) Under what conditions, if at all, will 
a program fail to halt? 

(4) How can a given program be simplified, 
in some sense, or cade more efficient? 

These kinds of questions are customarily answered by 

experienced intuition, for simple programs, supplemented 

by trial and, often, error for more complicated ones. We 

should like to replace such methods by a formalizable 

procedure, capable of being carried out by a computer 

program* Unfortunately, under most definitions, all the 

above questions are provably unsolvable. The proof 

might proceed by saying that if we could answer any of 

these questions, we could certainly establish of a given 

program whether or not it ever halts. Then, if our 

programming language is sufficiently powerful to carry out 



real computations, wo should be able to reduce the 
halting problem for Turing machines, or some such 
unsolvable decision problem, co the halting problen for 
certain programs in our language. 

In default of a complete procedure, it may, 
nevertheless, be worthwhile co look for various techniques 
by which to prove, say, that two programs are equivalent. 
J. McCarthy [9,10] introduces a formalism using conditional 
expressions, in which new functions arc produced from old 
functions by a recursive definition process. A computer 
program in his formalism, given in the form of a flow 
diagram, in which the flow of calculation Is controlled 
by conditional egression*, can be converted to a series 
of function definitions. Various techniques are presented 
whereby such recursive functions, and hence programs, may 
be proved equivalent. 

Cooper [1] generalises certain of these equivalence 
proofs to programs Involving undefined functions, satisfy- 
ing certain conditions and relations. These techniques 
seem to be applicable only to programs of a relatively 
simple nature and furthermore, for any actual programming 
language and computer, the intrinsic relations between 
the operations expressible in the language are often 
complicated- Even apparently simple relations may have 
to be qualified by exceptions and adjustments corresponding 



to the Idiosyncrasies of the language and computer* As 
a fairly trivial example, algebraically: 

(x+l)<x-l) - x 2 -l 

but the corresponding results of the two 'routines': 

Y :- X-l 

v *. vii j X :» XxX 

X :* X+l and 

X := X-l 

X :- X*Y 

as calculated by a conrputer will rarely be the same, and 
if X initially has a value close to unity the first 
expression ia likely to give the more precise result. 
We cannot in general allow two such routines to be 

treated as equivalent. 

Cooper stresses the need to be able to prove 

'shallow results about large programs', by which he 
means results which depend only on rudimentary properties 
of the language and the sinple relations between the 
basic functions; for example, we might consider the 
purely graph-theoretic transformations which can be made 
on Che flow-diagram of a program. 

lu. lanov [6], whose work is reported also by 
Rutledge [15], accomplishes this by considering the 

'logical schemes 1 representing the sequential and control 
properties of programs which remain when almost all the 



Information about the nature of the basic operations is 
disregarded. He la able to obtain a complete decision 
procedure for the equivalence of schemes, but sacrifices 
a great deal of the essential structure of programs, 
leaving, in effect, little more than finite automata. 

The work described here is an investigation 
into a model of computation which is, in a sense, an 
extension of lanov's model. We regard our 'computer 1 
as having storage divided up into a finite number of 
discrete 'locations', with each of which is associated a 
value from soae fixed, usually infinite, domain. A 
typical computation statement or instruction: 

L 2 :- F(L 2 ,L 3 ) 

has the effect of assigning a new value to the location 
L«, which depends in some fixed but unspecific way on 
the previous values of L ? and L* and the symbol F. The 
only other kind of statement performs some test T similarly 
unspecified > on the value of a location (without changing 
this value), and decides, on the basis of this test, 
which statement is to be executed next. 

We are free to regard a statement as ■ 
'machine-code 1 order for some computer, as an assignment 
or conditional jump statement in some autocode or 
ALGOL-like language, as a sub-routine of a program (in 



which case a location may be a whole block of storage 
or some more general data structure), or even as some 
step in a process quite unconnected with digital comput- 
ation- We shall usually adopt the terminology and motiva- 
tion of the second of these. 

A program, Bade up from such statements, is called 
a 'program schema', because as yet the function 
and tect symbols which appear have no corresponding 
semantic content, that is to say* no arithmetical or 
logical functions are associated with them. An 
'interpretation 1 of the language is an assignment of 
appropriate functions to thesfi symbols and of initial 
values to the locations. An interpreted program schema 
is then, in effect, a computer program which could be 
executed on some Idealized computer. We can define two 
program schemata to be 'equivalent' if, under all 
interpretations, the two programs produce identical 
results. Of course certain interpretations will be 
quite unsuitable in a computational contest, and we 
can introduce various restrictions on the kind of 
interpretation, considering perhaps recursive interpreta- 
tions, or interpretations ov«r finite domains of values. 

We should point out at this stage some of the 
shortcomings of our formalism, which ariss from the 



wholesale reraovaL of explicitly defined functions and 
our consequent inability to express the self-rodificatlon 
of computer programs. We have no way of representing the 
recursive use of sub-routines or the indirect addressing 
of storage registers* and we must treat any vector or 
matrix as a single location rather than as an array of 
distinct elements. An extension of our language to 
include some such features would certainly be desirable, 
but this is beyound the scope of this dissertation. 

The model of computation which program schemata 
provide, allows us to study the characteristic properties 
of t at least, some kind of computational process without 
the interference, and dominance, of the arithmetical and 
logical properties of the basic operations involved. We 
envisage a practical procedure for the autocaatic simplifi- 
cation of computer programs, which would consist of 
abstracting the corresponding schemata, applying to these 
some schemata-simplifying techniques which would be quite 
independent of the computer or language concerned , and 
finally translating back into the original language. In 
finding simplification algorithms for schemata, however, 
ve have very soon to face the problems of deciding when 
two schemata are equivalent. 



PART I. DEFINITIONS AMD ELEMENTARY RESULTS . 
SI. NOTATION AND PREREQUISITES . 

We shall assume the familiarity of the reader with 
some of the elementary results of mathematical logic 
relating to computability and solvability, which appear, 
for example, In Davis [2] or Hermes [4]. We assume 
known a formal definition of algorithm or effective 
procedure . A decision problem enquires as to the truth 
or falsity of each of a vhole class of statements, and 
such a problem is (recursively) unsolvable if there is 
no algorithm which supplies all the answers. For 
solvable decision problems, wc will refrain from giving 

formal algorithms for the solutions and rely on intuition 
to verify that the informal procedures we describe could 
be replaced by such. We say that a decision problem is 
partially solvable if thsra is an effective enumeration 
of the true statements in the class, or equivalent ly t 
if there Is an algorithm which, when presented with a 
statement of the class which is true, comes to the 
correct decision, but otherwise fails to terminate. 
We often wish to use results from the theory of 
Turing machines. The notation we adopt is close to 
that of Davis [2], Wa give here our main definitions 
and state the theorems we shall require. 



Definitions. 



A Turing tsia chine M, is a (2m x 4)-matrix f (m > 0) , 



of the fora; 




1 




Si ° 



b 2n-2 q 2m-2' 



^n-l q 2tn-l' 



% q 2 B ' 



where q,,...,a are distinct integers and: 

• »<L_) for i D l,...,2m. 



V E <<r 



The q. are states and q. 1b the initial state, of M. 
Each b. is one of ( 0,1,L,R,H h Ve shall sooetiaes 
denote 0,1 by fi-^a. respectively. 

A configuration ft, ( 'instantaneous description' in 
[2J) Is a string of symbols of the form: 

where P t Q are (possibly empty) strings of 0*s and l's, 
] ■ or 1, and c is a special symbol. 



o is an initial conf lfturation if i = 1, and a is a 

terminal configuration if ' q a H q ' Is a row of M, 

for some q» 

The consecutive configuration too is: 

(1) OPq^QOt if a is terminal {especial note I) 

«i) OPq^QOc if ' q i aj a |c q r ' is a row of M, 

(ili) OPa^QOc if ' q ± a R q r ' is a row of M, 

(iv) ORq r a k a 3 QOc if ' q ± a. L q y ' ia a row of M 

and P * Ra Rt 

(iv)' q r Oa Q0« if F q t a I q^ ■ la a row of H 

and P is null. 

A computation C, is a sequence of consecutive configurations 
Starting from an initial configuration. The th 
configuration of C is the initial configuration and the 
n+1 is the consecutive configuration to the n , for n >. D. 
C halts if there is a terminal configuration in C, and 
diverges otherwise. 

C halts on Q i f C halts and there is a terminal configu- 
ration of the form: 

PqjOQc 
and similarly for ' C halts on 1 r . 
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Consider the computation C corresponding to 
machine M end initial configuration: 



ll....l qi 0c 



which may be described as ' starting M on n ' . We shall 
say that: 

C[H f n] = if C halts on 0, 
- 1 if C halts on 1, 
and C[M,n] diverges otherwise* 

The symbol will be thought of as a 'blank 1 symbol , and 
the configuration: 

q-Qc 
will be described as ■ starting on a blank tape r . 

We now state, without proof, the few theorems 
which we shall require. Each is readily derivable from 
well-known results which appear as main theorems in any 
standard work on the theory of Turing machines. (Davis [2] 
Hermes [4J0 

Theorem A. The decision problea of whether C[K,01 

diverges for arbitrary M, is recursively unsolvable, 

and is not indeed even partially solvable. 
[Homes* p. 16A] 
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Let H. ,K^t . . be an effective enumeration of the 
set of all Turing machines. 

Theorem B . There is a Turing machine I), with the 
property that, for all n: 

either C(U,n] - C[M ,nj 
or C(U,n] and C[M ,nj both diverge. 
[From Hermes, §30] 
We will call such a U a universal Turing machine . 

— cortfra — ' There Is a Turing machine U, such that it 
is recursively unsolvable whether or not, C[U,n] diverges 
for arbitrary n. 
[Froa Thcorea ft and Heroes, p,14A] 

Theorca D , Given any recursive function f, there is 
a Turing machine M, such that, for all n >_ 0, 
C(H,nl - 1 if f(n) - 

» o of f <d> y 

[Hemes, Section 16] 
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j 2- 'PROGRAM SCHEMATA' AND SOME BASIC RESULTS, 

£ 2.0 In this section, we give the definitions of 

'program schemata" and 'interpretation 1 , of '(strong) 
equivalence 1 and several other relations between schemata, 
and justify our choice of some of these. We also consider 
alternative formulations of these relations which do not 
involve the concept of an 'interpretation*. We Introduce 
the idea of a 'reasonable' relation between schemata, 
which is any relation Intermediate to strong and weak 
equivalence, and finally, we formalize as a 'rul*-book f , 
the kind of algorithm which we would expect to be useful 
in simplifying schemata. These two concepts are due to 
D.M. Park. 



S 2.1 Definitions . 

The formal language we will use contains the following 
symbols: 

(1) intcgor3, 
(11) F , F^, F,»- - - t Fj,F„.. . (operator or 

function symbols) » 
(ill) T i( T ?> * ■ *» T k (transfer or test symbols) 

(iv) L.,L 2 ,.,,,L (location symbols) , 

(v) :■ (the assignment symbol), 

(vl) brackets , cocnas, and other auxiliary symbols* 

The number of symbols in the language is finite, but 

we shall not usually be concerned with the precise 
number. 

The statements (or Instructions) of the language 
are of two types: 

(1) Operator instructions , (or computation 
instructions .) 

and (2) Transfer instructions , (or tests .) 

a. T u ( Lj ) b.c 

where l p b,c v j»lC2 k n » t » n are integers. In both 
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cases, a is the prefi x or address of the Instruction. 
Id (1), t is the assignment location and W ■•••» L. 

are the retrieval locations. In (2), b and c are 

the left- and right- transfer addresses respectively. 

A program scheoa is a finite sequence of 

instructions together with three integers , the Initial 

address which is the prefix of sotne instruction, and 

the left* and right- tcnninal addresses which are not 

prefixes and are distinct t such that: 

{1) the prefix of each instruction is its 
position in the sequence, 

<ii) each transfer address is cither the prefix 
of some instruction or else a terminal 
address, 

(ill) either the last instruction in the sequence 
is a transfer instruction or else its prefix 
is one less than a terminal address. 

We define a sequence t. through a gcheaa as a (finite 

or infinite) sequence of integers satisfying the 

following conditions (where *(l) denotes the i element 

of t); 

(1) t(1) is the initial address of the schema, 

(2) if u{i) is the prefix of an operator 
instruction then; 
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n(i+l) ■ n(I) + 1 

(3) if *(I) is the prefix of a transfer instruction 
vith transfer addresses b,c» then ; 

*(i+l) is either b or c 

(4) if v(i) is a terminal address, it is the last 
element of n , 

The value of a finite sequence through a schema is 
defined to be or 1 according as the final element 
of it is the left- or right- terminal address. The 
value of an infinite sequence is undefined. 

If the operator symbols and transfer symbols 
of a schema are interpreted as standing for particular 
functions and characteristic functions over a suitable 
domain, then the schema can be regarded as a computer 
program which could be executed by some idealized 
computer. A computation starts at the initial instruction 
of the schema with a specified initial value from the 
domain, associated with each location symbol* An 
operator Instruction assigns s new value to the 
assignment location and the succeeding instruction 
is executed next. A transfer instruction leaves the 
values of all the locations unchanged but applies its 
characteristic function to the current v$lue of its 
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location and, according to whether the resulting value 
is or 1, the instruction with the left- or right- 
transfer address respectively, as prefix is executed 
next. 

More formally, an interpretation I, Is a mapping 
from the location, operator and transfer symbols into a 
set D, and the sec of functions and characteristic 
functions on D, such that: 

(I) to each L is assigned some element I(L f ) c D» 

(II) to each F is assigned some n-adic function 

I(Fj) : D n - D, 

(iii) to each T is assigned some characteristic 
function 1(T ) : D * {0,1}. 



u 



The execution sequence it-, and computation sequence A~(n~) 

corresponding to a given interpretation I, are defined 

u follows, where AjO^HOCj) is the j ch element of this: 

(1) Hjtt) U the initial address t 

(2) A^iij) <1) - <I(L 1 ),...,ia o )> 

(3) if Wjd) is the prefix of the instruction: 
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then: 



ijCi+l) - "jd) + 1 



AjC »j) Ci+1) (j) - Aj( Bl ) (i) (j) If J / v. and 

AjC nj) (1+1) (v) -UfJ) [AjC ij) (i) (kj) AjC Tj) (i) (k t )] 



(4) If Bjd) Is the prefix of the test: 

W b - c 



then: 



i,<i+l) * b if MT^lAjOTjMiHvM - 



- c if " ■ - l 

and 

For any scheoa P and interpretation I, we define 
val(P -) as the value of the corresponding execution 

sequence tt if this is defined- In this case we shall 

say that P converges under I, and succeeds or fails 
when vaUPj) - or 1 respectively. Otherwise val(P.) 

is undefined, and P diverges under I. 

Definition . Two program schemata P»Q are ( strongly ) 
equivalent . P S Q» if, for all interpretations I, 

either val(P-) - val(Q-) 

or both P and Q diverge under I* 
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This is a very stringent relation, but we shall 
alao study various weaker relations, which ore oil 
obtained by restricting the range of the quantification 
in the above definition. For convenience, the more 
important of these relations are defined here. It will 
be clear that the domain D of any interpretation may 
be taken to be countable and, without loss of generality, 
wc nay take for D either the set of natural numbers or 
the set of finite strings over a finite alphabet* A 
recursive interpretation over D is one for which all 
the functions assigned to symbols of the schemata are 
recursive functions over D. A finite interpretation 
is one which can be defined over a domain of finite 
cardinal. 

Definitions . Two program schemata P,Q are recursively 
(respectively finitely ) equivalent , P -r Q, 
(respectively P :f Q J if* for all recursive 
(respectively finite) interpretations I, 
either val(P T > ■ val(Q.) 
or both P and Q diverge under I. 
Two program schemata P,Q are weakly equivalent , P = <], 
if, for all interpretations X, 

val(P-) ■ val(Q_l when both are defined. 
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Note : " is not an equivalence relation, since an; two 
schemata are both weakly equivalent to a schema which 
diverges under all interpretations. 

The apparently weaker relations such as 
weak finite equivalence , ( P «t Q), obtained by 
restricting the last definition to finite interpretations* 
turn out to be the sane as weak equivalence . For example, 

Lemma . P *£ Q *> P a Q 

Proof + Suppose there is some interpretation I* for 

which both P and Q- converge, but P succeeds and 

Q. fails, then there is clearly same finite Interpretation 

I 1 , for which P T , succeeds and Q , fails. I 

Thus we have the three implications expressed by: 
P I Q -> P =r Q ■> P = 1 Q *> P » Q 
and we will show later [Theorem 5.2] that none of these 
implications can be reversed. 
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I 2.2 Justification of definitions . 

At this point it seems appropriate to explain 
our choice of definitions for equivalence etc. We 
have chosen to ignore the final values associated with 
the locations and to record only the final address* 
We regard a schema more as a (partially defined) 
predicate than as a function or transformation. We 
can show, however, that our definitions are not really 
any loss general, by demonstrating how other definitions 
and formalisms could be 'simulated 1 by our schemata. 

For example, suppose we select some subset 
R" I L ,... t L } of the locations, and call two 
schemata R-equivalcnt , if, under all interpretations, 
the schemata either both diverge, or both converge with 
matching final values associated with each location of 
R. (This definition, with R being the set of all the 
locations is used by Luckham and Park (7)]. Given any 

test-symbol T, and new function symbols F , . . . ,F , 

1 r x 

we may append at any terminal address of such a schema 

P, the sequence of instructions: (viz. p. 28 for notation), 

L :-F <L ) 
r l r l r l 
Itt ) e , HI 



X X X 

T <V e °' e * 
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to produce the schema P' say. It Is clear that, for 
all P,Q: 

P R-equivalent to Q<-> P 1 = Q' 
and that similar results hold for recursive, finite 
and weak equivalences. 

Instead of allowing a schema to halt, we may 
have it produce 'output 1 during the course of its 
computation. An 'output instruction' could be of the 
form: 

OUTPUT :- L. 

which leaves L. unchanged, but records its current 

value as the next in a sequence, or 'output tape*. Two 
schemata would be output -equivalent i f » under all 
interpretations, the corresponding output tapes chcy 
produce are identical. For tho simulation, we introduce 
a new location N and a new function symbol H, and 
replace each output Instruction of the above form by; 
N :» H(L ,H) 

T(N> e Q , *+l 

Again, two schemata are output-equivalent if and only if 
their transformations are equivalent. 

Similarly, suppose we allow 'input instructions 1 
in our formal language, such as: 



22 



L ;- INPUT 

with the effect that L. takes on Che next value from 
a fixed sequence of values, or 'input tape'. This pay 
be simulated by introducing a new location N* and a new 
function symbol F', and replacing each instruction of 
the above form by: 

Lj :- F'(N') 

W :- rw) 

so that the input tape is provided by: 

f<x), f(f(x>>, f(f(f(x)>), ... 
where f * I(F') and % * I(N'), By other constructions, 
many other, apparently new, features (such as statements 
of the form: *L :■ L ') may be simulated. 

Our definitions have their advantages in the 
ease of presentation of results and proofs, and in their 
simplicity. We note particularly that only two kinds 
of instruction are required. (For some purposes we even 
reduce this number!) The chief disadvantage is that, 
given two equivalent schemata, it is not usually an 
equivalence-preserving operation to replace one by 
the other as a pub-schema of a larger schema . However 
where ve are concerned more with decision problems 



22 



than simplification algorithms, this consideration is 
of less importance. 

j2.3 Syntactic characterizations , 

Now we establish an alternative, more syntactic, 
definition of equivalence, which is due to D. Luckham, 
who proved essentially this result in [BDN Memorandum, 
October 1963 (unpublished)]. To this end we define, 
for any schema P and any sequence i through ?, the 
free computation sequence . A(n) , which is a fiequence of 
m-tuples of strings of symbols, where m is the number 
of locations used in the schema. A(n) is defined 
inductively by: 

(1) A00<1) - < L^.,,1^ > 

(2) if Tf(i) is the prefix of the instruction: 

v r i<\ v 

then: 

A(ti)(l+l)(j) - AOOUMj) for j / v f 
and: 

A(w)(t+i)(v) - rjt x ...x v 

where X^ is the string A(it)(l)(v )• 
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(3) if n(i) is a transfer instruction than: 
A<*)<i+1) - A fr HO - 

For any sequence n and any transfer symbol T , 

u 

we define two sets of strings L Or) and ft {»} as 
follows: 

Consider the set of all i such that t{1> is 
the prefix of an instruction of the form, 

and 1) f c, then, 

AU)(i)(v) c L u (») if n (i+l) = b ( 
c ft (t) if n(i+l) - c . 

Definition . A set of sequences t (PJ ,T (0, ,n (a) , . . . 
through schemata P,Q»R> ... are consistent if, for all 
transfer symbols T , the sets: 

V* w > w V nCQ,) * — 

and ft U (P) ) u \(n <Q) ) O .... 



are disjoint. 

Theorem 2.1 For any two schemata, P E Q if and only 
if, for all pairs of sequences iT ' «nd t<^ through 
P and Q respectively, if r^ ' and ir" J are consistent 
then the values of r and t " J are equal or are 
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both undefined. 

Proof. Suppose that P = Q and that n ' and ¥ tQ * aro 
consistent. Let I be the Interpretation over the 

domain D of strings of function and location symbols , 

defined by: 

(1) I + a ± ) - L ± for each 1^, 

(ii) I + (Fj)(x lf ... > x t ) - Fy* r ..x c (i.e. the 

string obtained by concatenating the symbol 
F y and the strings x 1> ..,,x ) for each t\ 

(til) I + (T u )(x) - if x cl u (ir (P) )4iL u Cw CQ) ) 

-1 otherwise. 
Clearly t^™ - , (P) and i^W - „«»• » h «* 

*,+ is the execution sequence in R corresponding to 

1 , for R ■ P,Q, Therefore the values of ir ' and t'^' 

are equal or are both undefined. 

Conversely, suppose that P^Q and val(P T ) / val{Q_> 

for some I say. Any interpretation I, defines a natural 
mapping I from the symbol strings which occur in free 
computation sequences into the domain of l f by; 
(i) I*a t ) - IUj) 

(ii) iV^.^) - Iffjjl I*Oy t ..., l*(E c ) ] 

where E. t ..*,E are strings. 
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Clearly, for any execution sequence t_t 

I*< A(n I ) <i) (j) ) = A^^) (i) (j) for all 1, j 
So for each transfer symbol T , and all strings E, 
E dt^ij) "# KT u )[ I*(E) ] - 



and similarly for 2 and 1. Since this holds for both 
v and if t they must be consistent, but va know 

that they do not have the same value. 



Corollary . A sequence is consistent if and only if it 
is the execution sequence of some interpretation. 

An interpretation which has the form of I in the above 
proof is called a free interpretation^ 

It is natural now to attempt to find 'syntactic' 
definitions for the other relations defined above. To 
obtain analoguous results to Theorem 2.1 f we roust impose 
some further condition apart from consistency, on the 
pairs of sequences t^ 9 * considered. For weak 

equivalence the way is obvious and we state without 
proof: 

Theorem 2.2 P * Q if and only if, for all paiTS of 
finite sequences i and n w which are consistent, 

the values of i and ti are equal . 
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Definitions. A sequence n through a schema is ultimately 
periodic if there exist integers h>0, k > 0, such that, 
for all n j> h, t(d) - *{n+k) if these are defined* 
Note: any finite sequence Is ultimately periodic. 
A sequence through a schema P is recursive, if n t 
regarded as a function from the integers into the set 
of addresses of P is recursive. 

Theorem 2.3 For any schema P and interpretation I, 

<i) if I U finite then ir^** is ultimately 
periodic, 
(ii) if I is recursive then « * J is recursive. 

Proof, (i) Denote the (m+l)-vector. 



< It 



by o(l)* Since the number of addresses 
of P and the domain of 1 are both finite t 
there are only a finite number of distinct 
values for n{i). Furthermore it is easy to 
verify that if a(i) - a(j), then 
a(i+n) ■ a(j+u) for all o _> 0, so that a 
is ultimately periodic, 
(ii) The definition of n is clearly recursive 

in the functions of 1. Kence If these are 
recursive, so is n * . 
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Unfortunately, and perhaps surprisingly, the 
Implications in Theorem 2,3 do not hold in reverse, and 
the problems of finding syntactic characterizations of 
the execution sequences of finite and recursive inter- 
pretations remain open. An example of a recursive 
execution sequence which cannot correspond to any 
recursive interpretation will be produced in S5.1, but 
a simple example for the other case can be given here. 



Schece S >. 



M 2 :- HMj) 
L 2 :- H(M J ,M 2 ) 



M x :- F(Mj) 
M 2 :- P<Mj) 



M 2 :- F(M 2 ) 



T(L r > # r >1 



T(L 2 ) a, e x 



Informal notation. 

In the interests of clarity, ve employ certain 
conventions and contractions when ve give examples of 
scheoata. Unless otherwise specified, the first Instruction 
is the initial instruction, and the terminal addresses are 
always denoted by e* and e. respectively. Prefixes are 

in general omitted except where transfers ere involved, 



In which case the prefix is usually denoted by a small 
letter. The expression * *+l ' wherever it occurs as 
a transfer address denotes the prefix of the following 
instruction In the scheaa. The effect of a transfer 
instruction of the fora; 

W a - • 

is independent of u and 1 and will be called an 
unconditional transfer . Where it is convenient to avoid 
the arbitrary choice of u and i we introduce the 
statement: 

goto a 
as an abbreviation for such a transfer instruction. 
Minor points to note are chat subscripts and superscripts 
are often dropped, where this cannot lead to any confusion, 
and a variety of different capital letters are used as 
symbols. 

As we examine the schema S_ , we observe that 
there is only one infinite sequence through it. Let 1 
be some finite interpretation with this as its execution 



sequence and we write: 
.0 



for KHj) 



and f n+1 a for KFJl^al, n>0. 

At successive points In the computation sequence which 
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correspond to Cho address a of the schema, the 
successive values associated with the locations H. and 



M^ are: 



f 1 . f 3 a 

f 2 a f 5 a 

f 3 . f 7 a 



Since I is finite, not all these values are distinct, 
so suppose: 

f s+r a * r*a for some r,a > 

Therefore: 

r s ,s+r ,s+2r e 2s+t+l 

fa-f a - f a - .,. - f a 

for some t ^_ 0. Hence, putting u ■ s+t: 
e u ,2u+l 

When locations L- and Lj arc tested for the (u+1) tloe ' 
their associated values are: 

b x - I(H)[E U a.f u a] and b 2 - l(H)[f u a t f 2tffl a] 
To continue for another loop requires the conditions: 

I(T)[ bj - 1 and I(T) ( b 2 J = 
but since b. - b- f v* have a contradiction. Hence S. 

has the affirmed property. 
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§ 2.4 ' Reasonable' relations and rule-books . 

The relation of (strong) equivalence is a natural 
one and the strongest among the sort of relations we ore 
considering. Weak 'equivalence 1 is rather less natural, 
but it seems to be the weakest relation of interest, 
since it is difficult to reject, on any perfectly general 
grounds, any interpretation for which both the relevant 
schemata converge, or more precisely, to reject any 
consistent pair of finite execution sequences* Finite 
and recursive equivalence are again core naturally 
motivated, because we consider, for the former, just 
those interpretations which are theoretically realizable 
by a finite automaton (such as a digital computer with 
a limited amount of storage), and for the latter. Just 
those which could be realised by a Turing machine (or 
a digital computer with unrestricted storage space). 
Many other relations, which fall between strong and weak 
equivalence, could be of interest, and it is convenient 
to be able to prove, at one stroke, theorems about all 
such relations. 

Definition . A relation * on program schemata is reasonable, 
if, for all P,Q: 

(t) if P * Q then P * Q 
(ii) if P £ Q then P ^q 
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Weak equivalence is an intransitive reasonable relation. 
The relation S » defined by: 

P< Q if val(P x ) - v*l(Q I ) whenever vaKPj) ifl defined 

is an example of an asymmetric reasonable relation- 
Finite and recursive equivalences are, of course, reasonable 
equivalence relations. 

Me may expect any practical algorithm for 
improving the efficiencey of computer programs to include 
a set of rules which nay be applied locally, to some 
part of the program, to produce some improvement or 
rearrangement. We will give a few illustrations of such 
rules applied to schemata. In each case it is easy to 
see that the transformation results in an equivalent 
schema. 
(I). Permutation of computation instructions. 

h :m n h } i f L 3 : " G(L 2> 

L^ :- G(L 2 ) tj :- F(L 2 ) 

(II). Deletion of 'vacuous' instructions. 
\ :- F(L 2 ) 



\ :- fl<L 2 ) 



•} L x :- C(L 2 ) 



(III). Substitution and deletion. 
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L 2 :- F< Ll ) 



:- PCL^ 



L 3 :- F( Ll > 



i- C(L 3 ) 



L 2 := G(L 2 ) 



(IV). Removal of redundant tests. 





(v) . Removal of repeated tests. 
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(VI) » 'Unwinding' loops (or the reverse.) 




(VII). 'Tightening* of loops. 



4 






L 




|u ° C-( tQ 



> 
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As we will see in retrospect. It la Inappropriate 
to consider such transformations in more detail ac 
present. If "*- is any relation between schemata, a 
'v -rule i s a recursive procedure, which, when applied to 
a schema P, produces if anything a schema Q such that 
p*vQ. We are only interested in ^ -rules when *v Is 
a reasonable and transitive relation, for then we can 
apply a sequence of such rules and produce at each stage, 
a schema which * -related to the original schema. 
The transformations given above are all simple 3-rules, 
but we can have very much more complicated rules such 
as the (possibly non* terminating) construction of E(P) 
described in S3. 2. A rule^book for i is a finite set 
of ^ -rules; and we will want to see to what extent a 
rule-book can serve to produce a 'simplest' schema 
^ -related to a given schema. Firstly though we show 
how a 'direct' decision procedure and simplification 
algorithm can be obtained for a special class of schemata. 
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|3. SOME SIMPLE EQUIVALENCE PROBLEMS . 

53.0 Karill [81 has considered some of the problems 
of sitapli f ication and storage minimization for schemata 
with input and output statements but with no transfer 
instructions. Wc now give a complete decision procedure 
for the equivalence of schemata of a more general type, 
schemata without loops, and show that this can be extended 
to cover any schemata which always halt. We observe 
that, for such schemata, strong and weak equivalence, and 
hence all reasonable relations, are the sane, so the 
decision problem for any reasonable relation is solvable. 
It will be evident that a 'canonical form 1 for schemata 
without loops can be obtained and ve do not pursue this 
further. The section ends with an unsolved problem 
presented by a certain simple schema* 
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3.1 Loop-free schemat a, 

A loop^free schema is one for which no sequence 
can contain the sarae address twice. For these schemata, 
the equivalence problem is, in a sense, finite, and 
fairly easily solvable, so we will give only a sketch 
of the solution together with an illustrative example. 

Since all sequences through a loop-free schema P, 
^re bounded by the size of P, there are only a finite 
number which go from the initial instruction to the 
left-terminal address e Q . For any such sequence *, we 
determine the finite sets JL (v) and ■ : (ir) , for all 
test symbol* T . Any II then, corresponds to a conjunction 
of conditions, each of which may be expressed in the form: 

W - e 

where E» ±u a string of symbols and c is or 1. The 
condition (on the interpretation) that P succeeds is 
a disjunction of such conjunctions, with one conjunction 
corresponding to each successful sequence. If we write 
just: 

then the disjunction, let us call it prop(P) , appears 
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as a sentence of the prepositional calculus with the 

'T (E.)' as atomic formulae, 
w J 

Since all the functions are independent, any 
assignment of truth-values to the atomic formulae is the 
assignment corresponding to some interpretation* and P 
succeeds under this interpretation if and only if the 
value of prop(P) » after substituting in these truth- 
values, is 'true 1 . Given two such sentences, prop(P) 
and prop{Q); 

prop(P) <-> prop<Q) 

is a theorem of the prepositional calculufi, if and only 

if they take the same truth-value for all such assignments * 

Hence: 

Theorem 3.1 The equivalence problem for loop-free 

schemata i& solvable. 

For an example of the application of the solution 
described, we consider the schema S-, given below, whoso 
flow-diagram is given in a self-explanatory form In fig, 1, 
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Schema S ^« 



L 3 :- G(L 2 ) 
T<Lj) a 3 , a 2 E x 

V T<L 3> V a 5 h 

ay L 3 :- FCLj.Lj) 

T(L 3 > a 4 , ^ E 3 



a fi . L 2 :- G(L 2 ) 

T(L 2 ) -7. ^ 



a L := FO^) 



T(L 3 ) n , a fi 



a 6 . L 2 :- G(L 2 ) 



a ? . L 3 :- Faj.Lj) 
T(L 3 ) e Q . *j 



The sequence* from a. to e Q are specified by: 



Pl :- VWVd 



WW e D 



p 3 :- ai , a2 ,a 5 ,a 6 ,a 7 , eo 
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aJu'.BF(uu) 





»*• 1- £ 2 



Q f . U ■■ P(l>.l,) 




a?. 



L , =F(L..L t )[ 



i 



!_,: = Fa.,L,) 
Ut* &(L.) 





»«• 2. S 3 
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Let Ej - PLjLj , E 2 - GL 2 , and E 3 - FIL^O.,. 

pTop{S.) Is cbea (the notes to the right of the schema 
nay bo of assistance): 

tOtf ~ T{E 3 > " T(E 2 ) " T(E 3 ) 

V 1*0^) " T<E 2 ) " T<E 2 ) " T(E 3 ) 

V H T ( E i) *-?0y ^T(E 3 ) ' T(E 3 ) 
which simplifies to: 

T(E 2 ) " T(E 3 ) 

S 2 is therefore equivalent to the schema S_ given as: 
Lj I- Faj.Lj) 
L 2 ;- G(L 2 ) 

T(L 2 > -+1, ej 

^ :- F(L 1( L 2 ) 

ia x 3 e , . x 

The flov-diagraa o£ S- is shown in fig. 2. 
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S 3. 2 Schemata which always converge, 

LetC be the class of schemata which converge 
under all interpretations. Wc shall give an effective 
method of constructing, from any schema in C, a finite 
loop-free schema which is equivalent to it. This 
solves the equivalence problems for C, since we solved 
the general problem for loop-free schemata in §3.1 

Any schema may be regarded as a directed graph, 
its flow-diagram , in a natural way. To each computation 
instruction there corresponds a node of the flow-diagram 
with one branch leaving it, and to each transfer 
instruction, a node with two branches leaving. Given 
any flow-diagram P, we define the lift of P . L(P) , which 
is a tree with a copy of the initial node of P as the 
root, and for every node N Q in P with successor(s) N. 

(and K 2 ) , every copy of N Q in L(P) has copies of K. (and 

tf 2 ) as successor(s) in the same way. Unless P is 
loop-free, L(P) is infinite, Vc illustrate the construction 
of L(P) by an example, (figurss 3 and 4 
below). Part of the (Infinite) lift of a schema S, ia 
shown and dotted lines indicate where parts are omitted. 
In these figures, and in the remaining figures of this 
section, we abbreviate: 



43 




and: 



i 



Li'cRW 





We can 'prune 1 away parts of the tree L(P), 
replacing conditional transfers by unconditional transfers 
wherever certain branches are inaccessible froo the 
root under any interpretation, leaving an equivalent 
flow-diagram. In fig, 4. such branches are crossed by 
double lines, as for instance the first tine that 
location L. is tested, the value associated with lc is 
one that has previously been tested in L. and found co 
have test-value 0, so chat the crossed branch can never 
be selected, The sub-tree of L(P) remaining when all 
such excisions are nade, we call the execution tree of P 
E(P) . For the schema S, the execution tree happens to 
be finite and is presented in fig. 5. 
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fig. 3. Floy-diagram of 1' , 



L»:'F(L t ) <- 








Pnri of i.(P) . 



kS 




El?). 



At> 



The construction of E(P) for a general schema P, 
is described now in more detail. Given the lift of P t 
we mark the initial node. The construction is then 
carried out in steps until, maybe, no further step ±a 
possible. At any step* we select any unmarked node N, 
whose predecessor is marked. If N is a computation 
node or a terminal node, we immediately mark it and the 
step is complete. If N is a tost node, we examine the 
(unique, finite) path from the Initial node to N and 
determine what expression (of the free computation 
sequence) is being tested at N. We further determine 

whether that expression has been calculated and tested 
by the sane test function at any previous node on that 
path. If not, H is marked; if so, then by the constuction 
it will have been tested just once and we can anticipate 
the result this time. We replace N by a marked 'empty 1 
node and delete the inaccessible succeeding node and 
its following sub-tree. The step is then cotaplete. 

Suppose the construction does not terminate, then 
the execution tree contains infinitely many nodes which 
are accessible from the initial node, b~ say. One or 
other of the successors of b , say b. , must also have an 
infinite number of nodes accessible from it. Similarly 
some b^. a successor of b t and some b . a successor of 
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b , and so on, each have an infinite number of nodes 
accessible fro« them. Therefore there is an infinite 
path b Qt b. ^2^,,.. in the* execution tree. (This 
is an application of the well-known Infinity Lemma of 
graph theory,) By the construction* any path in this 
tree is consistent and so, by the corollary to Theorem 
2*1* is an execution sequence of P. Therefore, if P 
belongs to C, the construction of E{P) must always 
terminate. In this case, there is an effective, finite, 
procedure which produces the execution tree corresponding 
to P, and It Is then easy to produce from this, a schema 
In conventional form, by ignoring empty nodes and 
introducing unconditional transfers where necessary. 
E(P) is equivalent to P and loop-free, and we have a 
decision procedure for the equivalence of such schemata, 
{Theorem 3.1)* Hence: 

Theorem 3.2 The equivalence problem for schemata 
which converge under all interpretations is solvable. 

We conclude this section by demons crating that, 
even in apparently simple cases, it is not always a 
trivial matter to determine whether or not a given 
schema is in the class E. The following schema always 
converges, but has an execution sequence of length 147. 
(see fig. 6) 
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L:= Km) 




fi£^2 



fin, j . 



i 



M^ F{n) 
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The schema in fig, 7. seems likely 'on general grounds* 
to be in C f but execution sequences of lengths greater 
than 600 have been found. I have been unable to settle 
this matter, which is left as an exercise for the reader. 
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PART IK UNSOLVABLE PROBLEMS . 

Luckham and Park [7] first proved the 
unsolvability of Che equivalence problem for program 
schemata and pointed out that this precluded the possi- 
bility of there being any 'completely adequate* 
simplifying transformation. Their proof was rather less 
direct than that presented here and involved the use of 
oany locations. R. Floyd (unpublished communication] also 
has recently produced an elegant 'indirect 1 proof of the 
above result. 

We prove somewhat stronger results about other 
reasonable relations and for schemata satisfying various 
conditions- Our unsolvability results are proved for 
schemata with only two locations and in Part III we shall 
see that this number is the best possible. 

In the next section we apparently digress to 
consider a variety of finite autoaata. 
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$U. TTO-TAPE AKD TVO~HEADED AUTOMATA 

14^0 Our unsolvability proofs for schemata depend on 
a class of schemata which can be regarded as 'simulating' 
two-headed finite automata, (The reader will have 
already had a taste of this simulation in following 
the execution of the schemata given in §3.) Although 
we could preceed independently of such analogues, it 
seems more natural and gives us greater versatility 
of exposition to derive our main theorems initially In 
this guise. Besides, scne of our results take on an 
extra Interest in their 'machine' form. 

Rosenberg [14} has independently obtained 
several interesting results on multi-headed automata* 
including what is essentially our Theorem £.2. The 
questions considered by Elgot and Rutledge [3] are related 
but do not seem to be directly applicable in the context 
of schemata. 

In this section we establish the basic theorems 
on the correspondence with Turing machines and prepare 
for the simiiation of two-headed automata by program 
schemata. Theorem 4.4 could perhaps be endowed with 
some philosophical significance. 
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$ 4 « 1 Tvo-tapg automata 

Rabin and Scott to [13J investigated two-tape, 
one-way finite automata* Such a two-tape automaton M» 
is a finite automaton equipped with two scanning-head* 
each of which reads its own input tape, one symbol at a 
time. With each internal state of K (except the terminal 
state or states) is associated an integer specifying the 
tape from which the next symbol is to be read, and a 
transition function giving, for each tape-symbol road, 
the next state of H to be entered. One state of H is 
designated ss the initial state* The reader is referred 

to [13] for definitions given in more detail. 

Our treatment differs in that we dispense with 
special 'end-markers 1 on the tapes and consider the 
behaviour of automata presented with pairs of infinite 
tapes. 

For any autcuaton M and terminal state s^ we 
define T (M) as the set of pairs of (finite) sequences 
of input symbols, which take M from the initial state to 
the state 8. Often the automaton M v will have Just two 
terminal states, denoted by a_ and -_. then, it is convenient 
to say that M accepts those pairs of tapes with initial 
segments in T (M) , and rejects those with initial segments 

in T r (M) . M will diverge on the set T_(K) of those pairs 
of tapes for which M never enters a terminal state. 
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We shall present particular automata by specify- 
ing all or part of the transition function either in a 
table or illustrated in a flow-diagram* In the latter 
the initial state is indicated by an arrow which does not 
originate at any of the nodes, and the ni«ber in parentheses 
at a node indicates which tape is to be read for the next 
symbol. For example: 




^pfe) 



* r 



fig. 1. Flow-diagram _ o _f M. « 
It nay be verified that: 

w 

,Q+1 -fll 



{ < l"0 t i^O > m>D > 



VMj) - { < 1*", l n D>, <l n Q, l m 0>, 

nr+2 



W " 



< ro, l*-* > »>o > 



{ < i" , r» > 
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We now describe the construction of an important 
class of two-tope automata. 

Lemma 1* Given any Turing machine U» there is a two-tape 
automaton H..* with terminal states n t ji and r, such 
that, if c is a configuration of U, and o 1 is the 
succeeding configuration ( see §1), 

< c* c 1 > e X (JL.) if c is not terminal, 

< c, c 1 > e T a <Hy> " c is terminal, 

and if the second sequence does not start with c', the 
pair is rejected. 

Proof. We outline the operation of M... Given a 
configuration under Head 1, it checks that the tape 
under Head 2 is identical, except for an extra symbol 
at the left end of the (Turing machine) tape, until 
some state symbol of U is encountered on either tape. 
M.. then verifies that the difference between the con- 
figurations is compatible with the rules of 0, and 
finally checks that the remainders of the configurations 
arc identical up until the final terminating symbol e, 
except again for an additional on the second tape. The 
transition table for M„ is given in fig. 2. Where a 
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fig- 2 Transition table for a 2-tape automaton M.,, to 
check successive configurations of a Turing machine U. 

state head symbol new state 

«i <v -J 

c I c. -J 

[x,q r x] 1 q A [xq^o^xl 

[q i a j- _1 2 a j [ VrV 

[JC^j^x] 1 Oj [a r -] If • q 4 flj L ^ ■ 1b in 0. 

IqjBj.a^) 2 q r [ -, -] if ' q^ a R q f • is in U. 

[l(*j.q r ] 2 a^ I -, -] if ' q ± a. a^ q^ ' is in U. 

I -, -J* if ' q t a H q ' is in 0, i ■ 



"i 

I -. -]* 1 a^ [a jt -]* 



No tea . 

1. The initial state is 



[ c, -]* 2 a^ I Ei 0]* 2- Transitions not given 

[ c, 0]* 2 r a art transitions to r, 

[ e, -]. 2 a [ C) ] 3 - The quantification 'for 

[ c, 0] 2 e n^ a11 Mr* ' is su PP°"d 

for each entry here. 
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transition is not given, a transition to the terminal 
state r_ is to be assumed. The names of the states are 
supposed to have mnemonic value, and the proof that M» 
has the required properties is by inspection. / 

5 *. 2 Two-headed automata . 

A two-headed (one-tape) automaton is identical 
with a two-tape automaton except that the two scanning 
heads are regarded as reading independently the same 
input tape. Alternatively, we can regard it as a 
two-tape *ut9Mtgn which is always presented with 
identical pairs oE tapes. For any two-tape automaton M 
with terminal states _ and v_ t the corresponding 
two-headed automaton M 1 is said to accept (respectively 
reject » diverge on) those tapes t such that: 

< t r t 2 > e T fl (M) (respectively T^M). T D (M) ) 

for some Initial segments t.,t 3 of t. 

Given a nmber of two-tape (similarly two-headed) 
automata M.,K 9J .,,, wo can construct a new automaton M, 
by replacing terminal states of the M , , where they occur 



in transition cables, by either new terminal, states or 
the initial states of aoeae H . The initial state of 
one of the M is designated as the initial state of H. 



For example! if: 

l 



n\ 



1 



n 



A 



a 



I 



(tf 



denotes 



\ 



ei 



is the autooaton of fig, l, l then: 



€7 




^i i ntH 



fig. 3. F 
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This automaton F f could of course be simplified by 
caking 6 as the initial state and deleting the state a*. 
F f regarded as a two-headed automaton , can only diverge 
on the tape: 

0101101110111101111101111110 

The behaviour of F on such a tape is that, whilst Head 1 
is scanning each sequence of l's t Head 2 Is scanning the 
succeeding sequence and, unless this contains precisely 
one more 1, the state ± is reached. 

Th eorem 4.1 There is a cvo-headed automaton F» which 
diverges on some tape but rejects any ultimately periodic 
tape. 

A consideration of the modus operandi of F serves 
as an introduction to Lemma 2. Given any Turing machine 
U, and initial configuration c (• 0(0)), let C(l) t C(2) . 
C(3),.... be the sequence of successive configurations. 

Lemma 2 » There is a two-headed autcaaton M.. which 

accepts Just those tapes with an initial segment of the 
form: 

C{0) C(l) C(2) C(3) .... C(n) C(n+1) 
where C(n) is a terminal configuration of U (and the 
first in the sequence). If machine U does not terminate 
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when started from configuration c, then H-. cannot 
accept any tape, but diverges on the tape: 

C(0) C<1) C(2) C<3> .... 
Otherwise M-. never diverge*. 



Proof. It is a simple matter to construct, for any 
configuration c, an automaton which reads only fron 
Head 2, and accepts only the sequence c, rejecting 
anything else. We denote this by: 



i 



H c 






He combine this with the automaton M_ described above, 
to produce K.. in the following way: 

1 




frG 



Because of the operation of M , any tape which does not 
conmcnce with C(0) is rojected* Head 1 then scans this 
C(0) while Hoad 2 reads the next part of the tape. The 
properties of M ensure that if this next part is not 
C(l) # the tape is rejected. After this stage, Head 1 
is at the start of C(l) and Head 2 is again ready to 
scan the next part of the tape, While Head 1 is scanning 
any configuration. Head 2 scans the subsequent part to 
check that it represents the succeeding configuration. 
These operations continue until a check fails or until 
a terminal configuration is read by Head 1. / 

The rr em 4*2 For any Turing machine U, there is an 

effective construction of a two-headed automaton A^ 
with the following properties: 

(i) if U halts when started on a blank tape, 
there is no tape on which A^ can diverge but It can 
accept some tape* 

(li) If U diverges when started on a blank tape. 
A., can diverge but cannot accept any tape, 

(ill) Ay cannot diverge on any ultimately 
periodic tape. 

Proof . If c' « q,Oc, then A^ is the automaton M- p , 
whose construction i* clearly effective from 0, and 
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which, by Lemma 2, satisfies (ij and (il). Property (ill) 
ip satisfied because A^ can only diverge on a tape which 
represents a sequence of successive configurations. 
On such tapes however, the cnd-of-coni iguratlon symbol e . 
occurs only at steadily increasing intervals, and so no 
such tape can be ultimately periodic* / 

With the help of Theorem A, ve immediately derive: 
Theorem 6.3 For two-headed automata A, the properties: 
<i) A accepts no tape, ( T fl (A) * J , 
(ii) A diverge* on some tape, [ T D (A) t J. 
are not partially solvable, even when restricted to 
automata which converge on all ultimately periodic tapes. 

We remark that the negations of these predicates are both 
partially solvable. This is obvious for: 

V 

For the other, the method of proof is very similar to 
that used for the corresponding result about program 
schemata. There is a procedure which generates, for 
any A, a tree whose nodes are states of A, such that 
any sequence of states describing an initial portion of 
the behavior of A en seme tape, corresponds to some 
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path in tho trea starting from the root, and vice versa. 
There are at most finitely many outgoing branches at 
each node, so that, if A never diverges and all the paths 
in this tree are finite, the infinity lemma assures us 
that there are only a finite number of nodes in the tree* 
In this case the generating procedure terminates, and 
so provides, in general, a partial decision procedure 
for the propertyi 

T D (A) - & 

i4.3 Recursive tapes . 

The next four lemmata prepare the way for an 
even stronger theorem, in which ve replace 'ultimately 
periodic' above by 'recursive 1 . A tape Is recursive If 
the sequence of symbols is given by a recursive function 
from the positive integers to some finite alphabet. 

Let n denote a sequence of n l T g followed by a 0. 
Lemma 3 . There is a two-tape automaton N which, for 
arbitrary n >0, when presented with a sequence: 

n n-1 ... 10 
under Head 1, will accept only the sequence: 
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O, 



n+2 n+1 ... 2 2 10 
under Head 2, rejecting all others. 

Proof. H 1b illustrated. Any transition not shown ts to r. 

l 

' \ 

£(0 fc^ JdJ fl R . A. H . 



Let U be aooe fixed* universal Turing Machine, 
(Theorem B,S1). We denote by * C Q \ the initial 

configuration of U started on n, and by ' C \ the 

■" nm 

configuration resulting after a steps of the computation. 



Lemma 4 . There is a two-tape automaton C n » which, gi 



1 
the sequence n under Head 1, can accept either of the 

sequences: 

' C n0 ° b °' a SiO 1 b 

where a,b are new symbols, and rejects any other. 

p roo£. The construction of C Q is easy. _/ 



i.vtr. 
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Leiaaa 5 . There Is a two-tape automaton H* which, given 

Che sequence: 

C & for arbitrary n.m > 0, and fi ■ or 1 
under Head 1, can accept the sequence: 

C n<m-1) 6 

under Head 2, lust when C is non-terminal, or when 

nm 

C represents a halt on the symbol $ (- 0,1), and 
rejects anything else. 



Proof, It is a simple matter to modify the automaton M,,. 
of Lemma 1, so that the terminal state a, corresponding 
to a terminal configuration, is replaced by two states* 
a* and a, , which are reached according co whether the 
configuration under Head 1 represent* a hale on a or 
a 1 respectively. If this new automaton is :L.\ then 
M* is given by: 




(*J 



<:\ 






■^ — >v<0 



m- 5. 




V(iV 
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Unrna 6. If U is a universal TUring machine , as 
described in Iheorem D, there is no recursive function f , 
such that: 

C[U,n] = =>f(n) - 

and C[U,n] = 1 =? f(n) - 1 

Proof. Suppose such an r is recursive, then, by Iheorem D, 
there is a machine M, and say M = M^ the m^ In the 
enumeration, such that: 

C[\.n] - 1 if f(n) = 

and c C t V n 3 = if f(n) j> 

Then; 

f(m) - =2> C[>L,n] - 1 

=* C[U> ] = 1 

=2> f (m) ■ 1 

and: f(m) * ^ C[l^,m] - 

=» C[U,m ] = 
=^f(m) =0 

Ihese contradictions prove the lenro. / 

Ihis result is of cours* well-known, but proved here for 
convenience. 

Consider nc« the wo*-beaded autcmaton S» whcee flow-diagram 
is given belcw (with transitions to r suppressed). 



c< 



W->/3(^-- >VU)— > JW 
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flfi.6. Flow-diagram of R . 

H rejects any tape not of the form: 

bl0aC 1Q 6 1 b2 10aC 

which is more conveniently displayed as: 

b 1 



Cjd «i b i i a 



11 " 



C U «i a 



"12 °1 

C 13 a l 



a C 30 a 3 

a C^ «„ b £ i £ 2 1 



C a « 2 a C 30 a 3 b i 1 2 l o 



C 22 4 2 



C 31 *3 



C 1H 6 1 a c 23 *2 a C 32 6 3 a Si 6 « a C 50 *5 b - £ 



where 6, ■ or 1 for 1 ■ 1,2,... 

Furthermore, Mg is designed to ensure that R can only 

diverge on such a tape If the sequence: 

6 i**2**3'"" 
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is such that the function f, defined by: 

f(n) - a n for n = 1,2,3,... 
satisfies the conditions described in lepra 6. Suppose that 
R diverges on some tape defined by a recursive function, g say. 
Ihe sequence of 8's is recursive in g, to be precise: 



f(n) - J n = g( ?/3(n 5 +6n ti +8n) } 



Therefore f is recursive , which contradicts Lenma 6. 
Hence: 

7::e o:^'j -.■' There is a two-headed automates R, which 
rejects any recursive tape, but can diverge on scne (nan- 
recursive) tape. 

For any Turing machine M, we can nxxlilV R so that tapes it 
does not reject have the form of the triangular representation above, 
except that at the beglJininss of successive rcws there appear 
the successive configurations corresponding to H started en blaric 
tape. This new automaton checks that the tape has this form, 
but accepts the tape if a teiminal configuration of H is ever 
reached. Again, it cannot diverge on any recursive tape, so 
vie have; 



0& 



Theorem 4.5 For any Turing machine U, there is an 

effective contraction of a two-beaded autcxnaton H„, 
such that: 

(i) If U halts fron a blank tape, there is no 
tape on which R^ can diverge, but it can accept sore tape. 

(II) if U diverges from blank tape, Hy cannot 
accept any tape, but can diverge, 

(iii) Ry cannot diverge on any recursive tape. 



5*1.4 Translation to binary automata * 

The theorems of this section have been proved 
for automata with large (but finite) alphabets, but it 
is more convenient to set up a correspondence between 
program scbeirata and binary automata, that is autccnata 
over an alphabet 10,1}* 

3g3SgLS*6 Theorems 4.1 to 1.5 are valid even if the 
autan&ta are restricted to being binary. 

FTcof. Let us rewrite Che alphabet, (a,b f e f l > > ...} > 
we have been using as: 

S- (s , Sl ,s 2 ,..., Sh } 
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and let p be the map from sequences on S to sequences 
cm (Q t l), which is obtained by replacing: 
8 by 



& x by 10 
s-, by 120 



etc. 



For any autaraton A over S, let p'(A) be the binary autamton 
obtained by introducing, for each state q, 
the new states q" 



,q ,...,q , and by replacing each 



line of the table giving the transition function of A: 

next state 



state 



Head 



S Q 3 i 3 2 • 



• "h 



r r l r 2 • ' ' r h 



3-3i'i by the new lines: 



state 



Head 



.h-1 



1 
1 



: 
1 



next state 
1 



r q 



p h-l* 
r h E 



where we recall that r is the reject state. It is easy 
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to verify that the automaton A will accept, reject , 
or diverge on* any tape t, precisely as the automaton 
p*<A) accepts, rejects, or diverges on, the tape p(t), 
respectively. Also, for any binary tape t 1 , not of the 
form p(t) for any t, 

(1) if some p'(A) accepts t 1 , then it also 
accepts sore tape p(t), 
(ii) no p'tA) diTOrgeson t*. 
Ptor any tape t over 5, 

(1) t is ultimately periodic if and only if 
■p(t) is ultimtely periodic, 

(2) t is recursive if and only if p(t) is 
recursive. 

We can therefore 'translate 1 each of the theorene in this section. / 
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55. UNS0LVA3ILny OF PROBLEMS OF PROSKAM SCHET'ViTA . 

» 5.0 The first two leranata show that the two-headed 
autanata of the last section can be 'simulated 1 by 
schemata, and provide an effective 'translation' procedure. 
We prove that recursive equivalence is properly 
weaker than strong equivalence and also that recursive 
interpretations are not characterized by their recursive 
execution sequences. Theorems 5-5 and 5.6 answer all 
the solvability questions of the kind we are considering. 
Finally we strengthen one of these results to show that, 
under certain very acceptable hypotheses, there can be 
no ' adequate' rule-book of sinplifying transformations . 
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§5 »1 Simulation and first results. 

We have only to sho* that any two-headed binaiy 
automaton can be 'sijailated'. In a suitable seree, by 
sane program schema, to be able to apply the theorems 
of 54. Suppose that our formal language contains at 
least cne test-synbol T, and at least era ftjnction syntool. 
We can assume, without Ices of generality, that there is 
seme monadic function synfool F, since otherwise we could 
take any function syntiol and use it as a monadic function 
by always using it with all of its argunents Identical. 
We require that the language ccntains at least two 
location symbols, Lj aixl Lj. For any interpretation I, 
of the language, let: 

Xq = I(P)[ 1(^)3 
and x^ = I(F)[ x n ] f Or all n >_ 

We define the tape of I , c», as the sequence: 

£ 1* € 2**3* * * ' vftiere E n * J * T ^ *n ^ for n - X 

For any (infinite) binary tape s, there is clearly an 
interpretation I, such that s « c^. 
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ignraa 1, (i) A tape s, is ultimately periodic If and only 
ir there Is a finite interpretation I such that s ■ e-* 

(ii) A tape s, is recursive if and only if there 
is a recursive interpretatlcn I, such that s » c- t 



Proof . The sufficiency is innediate in each case. As 

far the necessities, in (i), suppose that: 

B-V 2 

and for seme h^O, k^, ani for all n>0, 

"W+n B *h+n 

Let I be an interpretaticn over the donain of non-negative 
integers less than h+k such that : 

1(F) [K^J ]= 

KF) [m] = m*l if m < h+k-1 
= h if m = h+k-1 

and I(T}[m] ■ 6^ for all m in the decain. 
Clearly s * £,. For (11), sur^ose that s is defined by 
the recursive function f , then s is the tape of an inter- 
pretaticn I, whose decrain is the rravnegative integers, for 
which: 

KfHKLj)] = andI(F)fo]- mfl 

I(T)[m] = f(n>, / 
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Given a two-handed i^:.-i:y n\ Amazon .-,, Me zho.} 
hew to constmct a program schema PfA), such that the 
behaviour of A started on any tape s, will be paralleled 
by the execution of P(A) under any interpretation 1, 
for which s ■ Cj. At parallel stages in the cperatlcxis 
of A and P(A), as A reads the n th syntool & , of s with 
Head 1, P(A) ccaiputes the value x^ as defined above, 
assigns It to h^ and applies to it the test function I(T), 
thus obtaining the value * n . In describing the schema 
P(A), we shall prefix certain of the Instructions with 
synfcols qQ>qp...> corresponding to states of A, and 
tten use these synfcols as transfer addresses in instructions. 
Ihe first instructions of P(A) are: 

h := F(L i> 

after the execution of which under I, each location 
holds the value x Q . The next instruction of P(A) is 
that which is prefixed with the initial state of A. 
For each state q of A, which reads with Head i and has 
transitions to states q 1 , q n , accoitiij^ to whether the 
syntool read is 0,1 respectively, we have the two 
instructions of P(A): 

a. I^ := Fd^) 
Tt^) q\q" 
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The left- and right-terminal addresses are a and £ 
respectively. Which are replaced by e fl and e^ as usual. 
This concludes the description of F(A). 

To illustrate this construction, suppose A r Is 
the 5-state autonaton shewn belcw, with initial state q~* 



> V (o s > r 




Automaton A 1 - 



The schema P(A') is the following: 



Lj :• PC^) 

TClj) q a , e a 

q r 1^- := FCLj) 

TC^) e , q 2 
q 2 - 1^ := F(L 2 ) 

T(l^) q^ q x 
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Note: P{A f ) can only diverge under an interpretation 
whose tape is of the form: 

010U0112011U01U110U11110 

It succeeds under an interpretation which breaks this 
sequence with an unexpected O f and fails under interpret- 
ations which ^.ve an extra 1. 

I^rrna 2, For any binary tw>-heaied automaton A, and 
binary tape s • Cj , A accepts* rejects, diverges on, s 
according as P(A) succeeds, fails, diverges under 
interpretation I. 

Proof. Irmediate, frcen the construction of P(A). _y 

Using Lentnata 1 and 2 and Theorem 4,6, we can 
'translate* Theorems 4.1 ard 4.2 to obtain: 

Toeorcra 6*1 There are program schamata Z* and Z^ 
such that: 

(I) Z diverges under sore Interpretation but 
fails under any finite interpretation. 
(II) Z diverges under sorce interpretation but 
fails under any recursive interpretation. 

With the aid of these results we can settle several 
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questions which were mentioned in 52. We promised to 
show: 

'Theorem 5.2 Tte sequence of relation: 

is one of strictly increasing strength. 

Pefinitlcre, Let: 

Eq denote the schema ! goto e Q ' 
^ n ii « , goto e ^ , 

D it ,i ,. , &# g^ a , 

Eq always succeeds, Z^ always fails* and D always diverges, 

Proof of Theorem 5.2 We note: 

(a) ^ * D but E 1 if D 

(b) ^ =f 3? but E^ *r 7? 

(c) Ej =r Z R but E^ i Z R _/ 

We can ncdify Z n to produce a camter-exanple to the 
tenptir€ Exposition that any recursive execution sequence 
through a schema is the sequence corresponding to sere 
recursive interpretation. In terms of sutanata* we renove 
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the sequence of 6's from the main tape, and put It 
instead on a second tape. The autcenaton still demands 
that this sequence predict on which syntool, if any, 
each TXiring machine halts, and go can only diverge if 
the second tape is ncff^-recursive . However the automaton 
is desigred so that it does not take acccunt of a symbol 
on this tape until after the corresponding Turing machine 
has halted. In this way, the sequence of states when 
the autanaton diverges remains recursive. In terms of 
schemata, we Introduce two new locations K. and M~, and 
it is the 'tape of I 1 correspond^ to the location ^ 
which is the 6-sequence- It is used as a 'scanning head 1 , 
while M, stays constant. The instructions of Z^ concerned 
with 'reading the 5-sequence 1 are deleted, arri in their 
place: 

(i) whenever 'the syafcol b is read by Head l 1 
(In autorata tern©) the instruction: 

1*2 := W\) 

is executed, which corresponds to re-setting the scanning- 
head at the beginning of the tape, and then the original 
course of instructions is resumed, 

(ii) in similarly hybrid terms, the schema/ 
automaton M, , whose defining diagram was fig. 6 (|4.3) 
is replaced by: 
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> o, 



>:-j:v 




translates to ' ^ :« Ffiy 

T(l^) y, 2 ■ 



Bius, for any interpretation under which the new autanatcn 
diverges, the value or each 6 has no effect on the 
execution sequence until the simulation of the n 
Turing machine confutation has halted, artf so the 
(unique) divergent execution sequence is just a 'diagcnsl 1 
simulation of U started on every n in turn, and is 
clearly recursive. Ihere are interpretations I, uith 
this divergent execution sequence but since the tape of 
I corresponding to location M, has to be non-recursive, 
by lemma 1 any such I is non-recursive. 

Recalling the results of §2.3, we have: 



Tneoreai 5*3 

(I) Any finite interpretation gives rise to an 
ultimately periodic execution sequence, but there are 
ultijnately periodic execution sequences which do not 



correspond to any finite interpretation. 

(U) Any recursive interpretation gives rise to 
a recursive execution sequence, but there are recursive 
execution sequences which do not correspond to any 
recursive interpretation. 



S5.2 Decision ^rc:LQjj:^ for equivalences, 

Theorems ^,2 and 4*5 translated into program 
schemata terms, yield: 

Ttoorem $.*) There is an effective construction which 
produces , for any Turing rachine U, program scherata 
Py ard P„ r with the properties; 

(i) If U halts fran a blank initial tape, there 
1* no interpretation unier which either ?» or P-T* can 
diverge, but each can succeed under sane interpretation, 

(ii) If U diverges fran a blank tape, neither can 
succeed under any interpretation, but each can diverge 
under sote Interpretation 

(ill) P y converges urrier all recursive interpret- 
ations, but, in case (ii) above, P y can diverge under a 
recursive interpretation. 
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We shall use the synfcols P,Q as variables to 
range over the class of program schemata. In a mild 
abuse of language, we Shall write, for example: 

1 P^f Q is partially solvable * 

to rrean: 

1 There is a recursive procedure vftichj for all 
pairs of schemata P*Q* when applied to P ard Q, 
terminates if and only if PrfQ/ 

We write a for exanpie: 



' P (" r > Q ' 



as an abbreviation for: 

1 P^Q for all relations -v , satisfying, 
(i) et =^ -v 
(11) * z%> * * 

Thus O stands for all reasonable relations. 

lanre 3 * (i) P ff Q is partially solvable, 
<ii) P /Q is partially solvable. 

Proof. For any finite interpretation I, and schema P^ 
sinoe each location can only take on a finite nunsber of 
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different values, the ccnputation sequence A-(«-™') 
nust either terminate or repeat itself- So it is 
determinable In a finite nurtjer of operations whether 
vaKPj) is 0,1 or undefined. We showed in S2,l that 
= is the same as *f . The set of finite interpretations 
Is effectively enumerable, and the partial procedure 
for both $£ and /f consists in investi^ting the 
values of Pj and Qt for each I in turn, terminating 
only when a suitable disparity is found. _/ 

LemajL (i) P a D is true for all P. 

(11) P ij f ) D is partially solvable. 

Proof of (11) . If P / D , then ? converges under 
both interpretation, and therefore also under seme 
finite interpretation. The result follows frcci 
ZjEuunb 3* / 

Lretna 5 . P (! f > D is not partially solvable, 

Preof . Let Py 1 be the schera P^of Theorem 5.4 
nidified by introducing a 'dynainic stop* : 

e^. gpto ^ 
at the old right-tennLnal address , so that a failure of 
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Py is a divergence of ?«'. Then ?„* = D if and only 
If U diverges from a blank tape- The leama follows 
fran 'Theorem A, and the proof of Lenra *l . / 



iz-rrra 6 . (i) P ^E, is partially solvable. 

Ui) P CJ) \ is not partially solvable, 
(even when restricted to schemata which 
converge under all recursive interpretations.) 

Proof. (1) We showed in S3-2 that the equivalence 
problem for schemata which converge under all 
interpretations is solvable, and if P = E^ , it is 
certainly such a schenn, so we already have an 
acoropriate partial procedure. 

(ii> If U halts fran a blank tape f P^ f E ± • 
If U diverges fran a blank tape, P r =r Ej- Ihe 



result follows frxxn Theorem A, _/ 



Lecrcra 7 . P (* 1 EL is not partially solvable. 

Proof . Let P^ be the schema P^ codified by inserting 
an ' unconditional transfer' to e, : 



f Q . goto e x 



^ 



as a new instruction, with prefix the old left-tennljial 
'U J 



aidresa of P m i.e. by 'counting success as failure 1 . 



Then: 

(i) if U halts Pran a blaric tape, Py" = Ej , 
(ii> if U diverges fron a blank tape, ? v n fr E^ 

since P n " c3n diverge unler ran* recursive interpretation. 

Theorem A again gives the result, _f 



Although the last two letmiata show that, for 
any reasonable relation * , if it is stronger than or 
weaker than recursive equivalence, P ^ E* is recursively 
unsolvable, there remains to prove; 

Lenrna 3 . P O E, is recursively unsolvable, even when 
restricted to schsniata which converge under all recursive 
interpretations. 

Proof , (due to D. M. Park) Let U be a universal Turing 
machine. Using the techniques of this section and $lJ, 
we can construct for each n, a schema P m which converts 
under every recursive interpretation arXi 

(i) if C[u,i&] *> Q , P m can succeed, 
(ii) if c[u,i^ c 1 , P m always fails. 

Suppose there is a reasonable relation *v t such that 

1 P *v EL * is solvable, then there is a recursive function 
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f such that: 

c|jJ*n3 • ■-? P can succeed 

-=? f(mj = D 
and: 

C[u,m]-1 ^ V*l 

This contradicts Lama 6, J*J,3. _/ 

Lgiara g, p {=} q is not partially solvable, even If 

P,Q are restricted to schemata which converge under 
All recursive interpretations. 

Proof . Let ?J* be the schera obtained from P a r by 
'counting success as failure 1 just as in the proof of 
Lanra 7- Then: 

(!) if U halts from a blank tape, P^ / V^f , 
(il) if U diverts, P^ s P^", 

and both these scheneta converge urtfer all recursive 
interpretations . — / 
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Iftis result differs sharply from the main result 
of 83*2, where we showed that an^ reasonable relation 
has a solvable decision problem when restricted to 
schenata which converge under all interpretations . 
We sunrarize the results of this section, (abbreviating 
1 partially solvable 1 to •p.Sp 1 }. 



Theorem 5-5 



r#i 



(A) P {; f } D is not p.s. ; P (J x ) DlBp.s.; 
P * D is true ; P / D is false , 

(B) P = E, is p.s.; P {**) E^ is not p.s. j 



1 sr ■ 



P tl* J E 1 is not p. s.; 



P *f E 1 is p.s.; 
P / E 1 is p.s. 



.= * 



(B 1 ) P {') EU Is recursively unsolvable- 

f Of course the same results hold with E* in place of E- .J 

(C) P {*J* Q is not p.s.; P (?) Qisnot p,s.; 

P Zf Q 1b p.s.; 

P / Q Is p.s. 
An asterisk signifies that the result is true even if 
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the relations are restricted to schemata which converge 
under all recursive interpretations. 

Related questions are settled by; 

Theorem 5»6 

(A) Can P diverge under any 

(i) interpretation f 
(ii) recursive interpretation, 
(ill) finite interpretation ? 

(B) Dms P converge urrier every 

( i ) interpretation , 
(ii) recursive interpretation, 
(lii) finite interpretation? 

(A) (i) is not p.3, <B) (1) is p.s. 

(ii) is not p.s. (U) is not p 

(ill) is p.s. (Hi) is not p. 

Prtof. Ihe results for (A) (i) artl (ii) ccrae frccn the 
proof of Lenma 7, end (iii) holds because there is an 
effective method of detenaining if a given schema 
diverges tanSer a given finite interpretation, and 
because the set of finite interpretations is countable. 
The result for (B) (i) follows at once frcrc 



there being an efifective procedure which produces, 
frcra any always-convergent scheira, en equivalent schema 
without loops. As for (B)(il) and <ili), coraider the 
schem P u r , which is P y r with a 'dynamic step' inserted 
at the old left-terminal address, so: 

If U diverges from a blank tape, P„ cannot diverge 
under any recursive interpretation, but ir U halts, 
then there is a finite interpretation urtfer which the 
simulation of U terminates and ?* diverges in the 
'dynanic step 1 . / 



55*3 'Adequate 1 rule-books. 

To prepare for our chief result on 'rule-books 1 
for siJiplification, we must prove a result as broad as 
that of Leans 9, for the case when Q is sone particularly 
simple, fixed, schem, Z say. Ncra of E Q> E. or D is 
a possible choice for Z. Of course if we are demartfing 
that Z be sijrple, we cannot hope to prove: 

P O* Z .... (X) 
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(i.e. with the asterisk) to be not partially solvable, 
because if Z does not converge under all recursive 
interpretations then (X) is always false, and if Z does 
then it must surely always converge and (X) is partially 
solvable . 

Let Z be the following schema: 

a. ^ :» FO^} 

TO^ e^ a 

which, for any I, fails unless the tape of I consists 
entirely of 1'a. Ftor any TUring machine U, we can 
construct a scheira 2y which starts: 

1^ :- PtLj) 
Lj^ :- FtLj) 

T{L 2 ) e x> *+l 
a. Lg :- F(L-,) 

T(L 2 ) *+!, a 



and succeeds precisely under those Interpretations with 
a tape of the fora: 

n C(0) n-1 C{1) rr-2 --•• n-r C(r) for scne n > 



DO 



where C<m) denotes tlie m confieiration of U started 
on a blank tape, C(r-l) Is terminal and r^n. If the 
conputation of U has not halted after (n-3) steps tbs 
schema fails. The only interpretations for which £.. 
can diverge are those with: 



I*(P\) 



1 for all m 



which correspords ton 3 » above. Provided n is finite, 
a 'tijre-UjaLt 1 will have been set and Z.. niust converge 
at least by the tine this is exceeded. If U halts 
fran a blank tape, then there is an interpretation 
vfaich provides a sufficiently large 'tiae-liinit 1 n, fop 
Zy to succeed, but for which Z still fails, so: 

Z y /Z 

Alternatively, if U does not halt: 

since any finite 'tiue-lijnit 1 will be exceeded. Thus w 
have proved: 

Theorem 5.7 There is a very siinple 2-instruction 
schema Z, such that, for any"reasonabXe relation ^ j 

P * Z is not partially solvable, 
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In S3, *\ we discussed 'rule-bocics* Tor transitive 
reasomble relations, as the possible basis of a practical 
salification algorithm. Let ^ by some fixed transitive 
reasonable relation. 

Ceflnition . A rule-book B for <\, is adequate (with 
respect to sane concept of 'sinplicity 1 )* if, for all P, 
if P ^ Q and Q is 'siller 1 than P, then Q is derivable 
Trent P by a sequence of rules of & 

fH): Iferpothesis (about 'sinpliclty' and <v ), 

If Z is the 2-lnstruction schema described above, 
then P ^ Z is partially solvable for all P which are at 
least as 'sijiple 1 as Z. 

Under this hypothesis, which is trivially true if, say, 
only finitely nany schemata are at least as 'sijiple 1 as 
Z, we have: 

Theorem 5-8 There is no adequate rule-book for t, . 

Proof * Suppose ffiis ai adequate rule-book for^ , then, 
for any P, if Z is sinpler than P, Z is derivable fran 
P by rules of B and so P ^ Z is established by a 
partial procedure which merely enumerates sequences of 
such rules, ara3 if Z is not sinmler, P <v Z is established 
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by the partial procedure of the hypothesis (H). This 
contradicts Theorem 5-7 and the theorem follows, / 

We describe briefly tow we can obtain a similar 
result to Theorem 5*7 with P restricted to schenata 
which converge under all finite (or recursive) inter- 
pretations* We recall the schema 7? of Theorem 5-l> 
which could only diverge for an Interpretation with a 
tape of the forta: 

12 3 15 6 .- 

By using T in place of Z as tte initial portion of Zy, 

P 
we obtain a schema Z« with the following properties: 

(i) Z» converges uraier all finite interpretation , 

(ii) if U diverges, Z y P = 7? 9 

(Hi) if U halts, Z» can succeed under an 

interpretation whose tape starts: 

12 ... nC(0) rniCU) n-r C(r) 

for scras n>r>0, and terminal C(r-l), so: 

7 F d 7 F 

hi ' Z 
p 

A similar construction produces Zy fraa U and R. Thus 
we have: 
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Theorem 5.9 Ihere are schemata Z , Z , such that, for 
all reasonable relations ^ ; 

P % T { P % Z respectively) 

is not partially solvable, even if P is restricted to 
schemata which converge under all finite (respectively 
recursive) interpretations, 

There is a 7-instruction schema which may be 
taken as Z , so the hypothesis corresponding to (H) is, 
in this case, still very acceptable, but in the case of 
& , the restriction seems rather stronger. Of course, 
under these hypotheses, we obtain results corresponding 
to Theorem 5.8. 

Just ore more feature of a construction in this 
section is worth pointing out. We define a trivial loop 
as one with no exits, e.g.: 

a. goto a 
and a unitary loop as one with Just one exit, e.g.: 

a. L :-F(L) 
T(L) a, b 
In 13.2 we found a decision procedure for schemata 
which always converge, arri we could extend this to 
schemata which either converge or enter a trivial loop. 



s* 



We might expect that in any class of scJemata with 
unsolvable decision problems, sane schemata should 
diverge In cctnplicated ways, however even for the class 
of schemata which converge or else diverge In same 
specified unitary loop, the equivalence problem is 
unsolvable. Furtheraore the unitaiy loop rcey be of 
the v«7 sijcple form of the example in the definition. 
We merely remark that the schemata Z^ or this section 
have this property. (This result can be strengthened 
slightly to show the unsolvability for schemata which 
diverge in such a unitary locp or fail *) 
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FAKT III, SOLVABLE TISEIBC. 

Since we are motivated by the desire to use 
program schemata as a tool in the simplification ard 
'proving' of canputer program, the results of the last 
section can only be regarded as negative, The remainder 
of this dissertation is devoted to the more positive 
results we have obtained, which hold out sccie hope of 
providing a basis for practical techniques, Fran now 
en we restrict ourselves to the decision problem for 
{strong) equivalence* In scim cases the corresponding 
results for other relations are readily obtained, but 
in others there are further difficulties. We describe 
results obtained in two main directions. 

A characteristic feature of the schemata with 
unsolvable decision problems that we have been consider- 
ing, is that irost expressions which are calculated are 
recalculated later on in an execution sequence* a feature 
which would be unusual and undesirable in an actual 
caiputer program. We shall investigate various restrictions 
which can be imposed on schemata to inhibit this repetitive 
behaviour* Firstly though, we shall exajnine schemata whose 
flow-diagrams have a particularly siirple structure. 
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A typical Tana for the 'simulation' scheireta we 
have been using is the following: 

A 




where we neglect transfers to terminal addresses, and 
where X 'checks an Initial ccflinguration 1 and Y 'checks 
that each configuration follows frcro the previous one 1 . 
Furthermore, by slnulating a universal Turing machine 
we can obtain a class of schemta with unsolvable decision 
prctolens by keeping Y fixed and varying X. We observe 
that X Is always loop-free, so we do not require arbitrarily 
ccnplicated loop-structures in an unsolvable class of 
schemata. Hcwever, taking the structure of Y into 
account, the typical flarf-diagram can be put in the fcm: 
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and we see that the loops In the schemata are 'nested 1 
to a considerable degree. The schemata we investigate 
in the next section are those with no 'nested' loops. 
So far only a decision procedure for such schemata with 
only monadic function synfrols, has been found, I 
conjecture that ttere Is also one for the general case* 
but there are obstacles to extending the present 
procedure. 
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§6. zazi'VCtK with laii-Ti-nERSEcrinG i£OP:; /txp 

M3MADIC FUHCTIQH SYI'BOLS. 

;6>E We express the execution of a finite sequence 
of computation instructions as the multiplying of the 
vector giving the values of the locations by a certain 
ratrlx, and shew that any path, whether finite or 
infinite. In a schema with non-intersecting loops can 
be carpletely specified by a vector of integers with a 
bounded nunfcer of components. The value of a location 
after the execution of such a path proves to be 
expressible by a * linear wonl function 1 and a formula 
of the additive theory of the natural nuntoers with tte 
caipcnents of tte patl>-vector as free variables, tence 
we find a formula which expresses the equivalence of 
two given schemata, anl as the truth of formulae of this 
theory is decidable £ 5j, the equivalence problem is 
recursively solvable. 
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g.l Preliminaries . 

We define Sas the class of schenata satisfying 
tlie restrictions: 

(I) No cccputation instruction is In rcore than 
one loop. 
(II) The programing language contains crcly 

function ayntools which take one argument. 
We shall assume that ttere is just a single test synfcol T, 
but» in distinction frccn the above two conditions, this 
assumption involves no real loss of generality. For 
suppose we had the test function T' t then we merely 
introduce the new location L 1 and new function synbol P 1 , 
and replace each: 

T(L) b, c 
by; 

L' := P' (L) 
Ttt 1 ) b, c 

This is an equivalence-preserving transformation and 
one which also preserves conditions (I) and (II) above. 

Let P be a schema in S. We can assume that there 
is sore computation instruction in each loop, because 
there are obvious ways of producing a schema equivalent 
to P and having this property if P does rot have it. 
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end we select one such as a base-point In each loop. 
Let their addresses be b^b.^.^b _, which are distinct 
by condition (I)* 

A path is a sequence throu^i a schema which does not 
necessarily start at the initial address ror reach a 
terminal address. 



1. If there is a path fron bj to b. (i^j) tten 



there is not also a path fron b. to b,. 

Proof , Ctovious from condition (I), '_/ 

Pot any finite path p in P, we define, for each 
1 = l»+**»q, the i loop-coefficient » y,, to be ore 
less than the nuntoer of occurences of b ± in p if 
positive, and zero otherwise, Leirata 1 assures us that the 
occurrences of different base-points do not interlace 
each other. We now obtain the reduced path, p 1 , by 
cutting out, for each positive loc^coefficient, that 
part of p which follows the corresponding base-point 
up to and including its final occurrence, Tne reduced 
path cannot contain any loops irore than once, and so 
the nunber of possible reduced paths in any schema is 
finite. Let us call them p if ...,p for P. Ang finite 
path path p, in P can be specified coipletely by the 
reduced path p 1 - p. say, ard the loop-coefficients 

*1-" V 
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Suppose the function syirfrols F, f ...F arrf 
location syntools Li *•••*!*■, &?& ^ o* 11 * langua©s. Let 
Z be the free semigroup with generators F^ i-l,..p t 
and L-, j°l»..,ni and zero and identity elerents (D» 1) 
adjoined in the natural way. We use concatenation to 
denote the semigroup operation- 

Definitions * An tn>-) transform is an {m x m)-raatrix 
over Z with exactly one non-aero elenent in each row. 
toiltipllcation of transforms is denoted by concatenation 
and is defined by: 

(AB)— ■ K^ts wtiere c ** ^ u^QU* iJitsger 
such that A_ 7* 

Multiplication is associative and the prcduct or two 
transforms is a transform. It is convenient to 
abbreviate the instruction: 

h : - w 

by: ' " ■ 

M ^ T i1k'' tte wansforro corresponding to the instruction 
T. , w is define as A where: 



A^ = 1 ir r ji I 
A ■ otherwise. 
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has the foxm: 



i*a 












1 







jf 



... P. .. 

1 ' k 

1 



u 





1 

1 



A aegpenC la a sequence of conputaticn instructions. 
We define H{ ° ) » the transform corresponding to the 
segjrent a, inductively as follows: 

M(«) = I, the identity irstrix, if - is null, 

= H(T Uk )M(6) if * consists of a followed 
by th instruction T. ,. . 

Lepra 2 . If the columi vectors X ■ (x- , * . . f x ) and 
X = (y-i»***»y ) denote the values of the locations 
Lw.-.^Il in a free conputaticn sequence, respectively 
before an3 after the execution of the segjnent *, then: 

Y = W(«)X 
Proof* Qy inspection, are: induction on the length of *. / 



A word function is a function with integers as argjrents 
whose values are elements of Z. 
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A linear word function flwf ) Is a word function of the 
form: 

where U. cZ for 1=1,., >k, and m. Is a r>Dn-negatlve 
integer, 

u i tU iV^ u i 

■ 1 if n^ = 



16 . 2 P-representat Ions . 

Let ff be the first order theory of the natural 
nunfiers (non-negative integers) with addition as the 
only operation. Hilbert and Bemays ([5], vol.1, pp 359-366) 
have given a decision procedure for the truth of fonnulae 
in this theory. (Presbureer \l2] proved the elenentary 
theory of the integers to be decidable sonewhat earlier,) 
We shall call a fonnula of P a Presburger f ormda or 
P-foraula . A P-fonrula ^<t.,..,t ;n,,..,n ), with the 
free variables as indicated, will be called a P-fUnctlon 
if, for all natural nunfcers n^..,^, there exist unique 
natural maters tj,*.»t r such that ift^..,^;^,..,^) 
is true. This condition can be succintly expressed as: 
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(n)(E!t)f *(t;n) _] 

where the quantifier ' EI ', which states unique existence, 
is readily definable in terms of the equality relation 
and E, the standard existential quantifier. We shall 
often find It convenient to express netatheoretical 
statements in the formalism of the predicate calculus. 

A word function Gfr^,..,^) Is P-representable if there 
is a Iwf w, and a P-function $, such that: 

(t)(n)f *(t;n) =^ G(n> = W(t) J 

l£nnBj.. ir G^m), G 2 (n) are P-representable then so 
are: 

{i> G 3 (m,n) = G 1 (m)0 2 (n) 

<li) G 1) (n,n,r) - O^m) if sir) 

- G 2 (n) if -,6<r) 

where 6 is a P-fomula 
Prcof. Si^pose: 

(s,3i)[ i^sim) => G^m) = V^s) ] 

a^ (t,n)[ fl ? (t;n) ^> G 2 (n) = W 2 (t> J 
then: 

(1) <s,t,m,n)[ $, (sjdi) A * 2 (t;n) -=^. 
G 3 <ni,n) - W 1 tgflU& ] 



:■■:■ 



(11) (s,t,m,n,r)[ i(r) a ^(sjmJA t-0 .V, 
-fi(r) a ^(tjn) A s=0 : =i 



G y (m.n,r> - W-^sJW^t) j 



leinra \ . If A is an (id-) transform, there exist h,k > 
with h+k < frf", such that: 

A h+k • A*b where D is a diagonal matrix. 



Proof . We can find B,D 1 where B is a transform with 
elements either op 1, and D, is a diagonal matrix, 
such that: 

A ■ D-jB 
S im ilarly there is a diagonal matrix D ? such that: 

BE^ - DjB 
and we define D^. for each n > 1, by equations: 

ro n " Vl B 
and denote D 1 D 2 -...D r by DJ. . For each r, B** is a 
transform with only l's as its non-zero elements. There 
are at most m m such transforms, so not all of I,B,B?,.. f B^ ' 
are distinct. So ff** = b" say, for some h,k with 
h+lc < m™. Then: 

A b+k = <D 1 3) h+k 

■ (D 1 B) h <D 1 B) k 
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A--* 



D k B 



D/EB h+k where B h D£ ■ EB" 



"h™ 



v"\ 



ahg 



But IV is a diagonal matrix* / 

Let a be a segment, and let G ^{n) be the word- 
function whose value for each n is the expression 

associated with Li after n iterations of a, starting 
from the initial expressions L * L^, ••-,L m - 



Lemma 5- G . (n) is P-reoresentable. 



Proof . By Lemma 4, there is a diagonal matrix 

D = r^'"-''^]' such that: 

(M(a)) h+k = {H(ff)) h D for some h,X with h+k < m 



,n» 



Therefore we can find words in Z t say V^, "-' v v -1* W Q' fc- 

w^ ir and integers j x ^h-1' i l ' ' 'Sc-l of the 5et 

|i, , - - ,ro/ so that: 
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G fl(i (n) - L t if n ■ 



VjLj if n = 1 



v. ,L. if n = h-1 

3h-l 



Vl J 



WQdj_ r L, if n = k.r + h for some r > 

0*0 — 

w l d i rL i if n = Jc.r + h + 1 " 

1 1 



w k~l d i, r L i if n - X.r + h + (k-1) " 

K—X K— X 

For fixed k,h,s, we express ' n = k.r + h + s for some r > 
as: 

{Er)Tn » r+r+.,,,+r + h + s] 
^ -He times-*- J 

which is a P- formula. Using Lemma 3(ii) repeatedly, we can 

construct a P- function and a lwf which P-represent Q (n) . 

a*i 

Given P-representations of G . for 1=1,. .^nu and of 
G fl,i' we P roduce a P-representation for the word-function 
G ~ i( r ^ s ) of two arguments which gives the value of hi 
after the sequence ' * r B s \ by "substituting 1 the lwf's of 



G fl j for the appropriate Lj's in the lwf of G. 



,x 
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For a particular location L^ and reduced path p in a 
schema Q, the word-function G , (71 , . . . , 7 ) , which 
gives the value of location Li after the execution of 
the path p, with reduced path p and loop-cooef icients 
\*"*>\* if such a P exi *ts (and has the value 
otherwise), is P-representable. This result is readily 
obtainable from Lemmata 3 and 5, and the last remark. 

Now let h - (Uflfl^ *t q ) be a (q+2)-vector, 

which is intended to specify both a path in Q and a 
location l^. We define the word-function: 

Q(h) = S u jfr, 7 q ) if this is defined, 

= otherwise. 

Obviously is P-representable too. 

We shall be interested in both finite and infinite 
paths through schemata, in schemata of the class S, any 
infinite path must ultimately remain in a single loop. 
We represent such a path by its finite initial segment 
which terminates at the first occurrence of the correspond- 
ing base-point. We adopt the convention that the loop- 
coefficient for this ultimate loop shall be 0. For any 
schema Q, let: 
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(i) fin (h) be the condition that h represents a 

path in Q from the initial address to 

the terminal address e - 
(ii) fin - (M be the corresponding condition with e*, 
{iii) infQ(h) be the condition that h represents an 

infinite sequence from the initial node 

of Q, under the convention given above, 
(iv) sub (k,h) be the condition that k represents a 

path in Q which is an initial segment of 

the path represented by h, 
(v) T Q Q (k,h) be the condition that subQ(k,h) and that 

the path k is finite and laads to a 

transfer instruction at which the location 
corresponding to k is tested, and which 
requires that the test— function takes the 
value in order that the path h be 
continued, and 
{vi) T^ QS'^ ^e the corresponding condition with 1 in 

place of 0* 

We notice that , £in °(h) ( f 'fin^th)', and 
'inf (h) ' just take the value 'true' for a certain 
finite range of values of u (=h 1 ) under scene corresponding 
conditions on the loop-coefficients. 'sub**' is a 
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straightforward predicate involving only a finite 
number of possibilities for h, and ki, some equalities 
and one inequality between some loop-coef f icients ■ 
•T n and *T^ f involve 'sub^', but are otherwise just 
enumerations of a finite number of possibilities. I 
claim that 'fin Q ' , 'fin^ 1 , 'sub 0, , 'T^' and '* Ql 
are predicates definable in P, and there is an effective 
construction of them from a given schema Q, Detailed 
proofs of these claims would, however, be cumbersome, 
tedious to read and, surely, superfluous. 

We now require a method of constructing a 
P-formula from two given Iwf's, specifying the precise 
conditions on their arguments, that they take the same 
value- This is given by Theorem 6.1 below. First we 
need to prove: 

Lemma 6 > Let U,V be words in Z, and u • lengthfU), 
v ■ length(V). suppose that v^u and n is an integer 
such that n >(v/u)+l. If: 

U n X - WY for some words X,Y 
then there is a word W and integers r,s (>1) such that: 

= W c and V = W s . 
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proof. Assuming that \j ft V for any r, we can suppose 
that: 

V z Z-, , *z> ■ V for some r>0, and some t< u, 

where U = z ± z 2' * mZ u an ^ the z's are generators. 
Prom the hypothesis, we have; 



' .l z t+ 2"* z u z l — * t " *lV z u- 



t+: 



*i ■ z [ i+ t] - * [i+2t ] «*♦••- ■ ti+IH t] for a11 

m > 0, where hi 5 j (mod u) and < Pjl ^ u, for i^u. 
Let h ■ hcf(u,v) m hcf {u,t), then: 

z i = z i+mh f0 *l£i<h and m- 0,1 (u/h) - 1 

Therefore if w denotes 'z^e-..^' : 

U «= IT and V ■ W s where rh * u and sh - v , _/ 

$6.3 Decision Procedure 

Theorem 6-1 If H and K are lwf 's, there is a P-formula 

*H,K' which is effectively obtainable from H and K, 
such that: 

for all h,Jc, H(h) = K(k) if and only if 
% K ( -'£ J is true - 
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Proof - Let H(h) = ujl U^2 xjhra , K (k) = v Jl v *2 .„ vft", 

The result is trivial if m=0 or n=0. Suppose that the 
result holds for all pairs with m+n < t, where t ^ 1* 
We show that we can construct a p- formula *„ v for the 
case when m+n = t, then the result follows by induction. 

We write 'U < V* for words U f V when U 
properly divides V on the left, i.e. when UW * V for 
some non-null W. 

Case (i) U x = V x 

^H k'— '— ) ^* s ©bivalent to: 

(Eh) U^Uj 2 - - . ujjm - V2 2 - . . vjjn fs h x - h+1^ 1 , V . 

»> [ °5 2 ■ • • «** - *£ • • - *• ^ *i ■ * +h l] 
and the induction is complete. 

Case (il) u 1 < v 1 and Vj^ f u 1 and D 1 ji V^ 



[ 



*„ „(h,k) is equivalent to: 

o5 2 --- u hfn = v^ 1 . . . vj;n A h, - 

2 ml n 1 



. V . 



k. 



Tu^l . . . U h * = v£ 2 . . . v* n - k, = 

1 m * n 1 



Case fiii) U, < V, and similarly for V* < U. 
Applying Lemma 6 ( we see that there are two 
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alternatives: 

either (a) U - W r , Vj = W 6 for some word W and 

integers r,s, when # (h,fc} is equivalent to: 



V n ^ i + s.k, = r.h, . V - 

n 11 



(Ei)[w i u5 2 ... U^n . v * 2 ... 

(Ej)fu 2 2 ... ujj* 1 ■ W^V 2 ... V kn ^ j + r.h x = s.k 



where 'r.h, ' denotes 'h, + hi + .... + hj f , 

— r tunes ~ 



9E (b) there is no such w, and h, < r+1 or k* < 2, 

where r - [length (Vi) 1 In chls case * „ can be 

[length (U x ) J H '* 

written as the disjunction o£ the r+4 possibilities 

corresponding to h = 0.1, ,r+l, or k x = 0,1. In 

each of these the initial factor of either H or K can 

be replaced by a fixed word, so we require a P- formula 

to represent an equality of the form: 



wu 2 . . . U^ - V, i 



. .. u* 1 * - v, 1 ... V* 11 or similarly with W 

m n J 

, then we 



the right hand side. Let r - f l" 1 ?"* j w > 

[length (V i )J 

distinguish two further subcases; 

■ 
(b x ) if W^ vj +1 , then k x = 0,1 



or r - 



Each of these possibilities gives rise to an equation 
of the form: 
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wu!; 2 ... o^ = wv* 2 ... v*n 

* m -L n 

and then either w = WW" and the equation is 
representable by: 

wV 2 ... u h m = v5 2 ... v*n^ h = 1 l 
* m * n J 

or else W ^ W and the equation is always false 
(and so representable by , 0-l , f say). In either 
case the induction is complete, 

(b 2 ) If WY = V x r+1 and V 1 - XY for some X and Y, 
then we have the possibilities of (b,) or else 
k l £ **1* The latter case reduces to the equation: 

u 2 2 "" *£* = "5 ••• V n" where k+r+1 = fc i- 

We now repeat the whole process on this equation, 
taken in the form: 

If cases (i), (ii) or (iii) (a) occur, the induction 
is completed, in case (iii) (b) , we obtain equations 
of the form: 



U 2 * ' - U m = WYV 2 2 • • • V^ 1 



Of! WU h3 ... uj* = Y(xy) k V^2 ... v^n 
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both of which can be dealt with as in case (iii) (b.) . 

For any schema and path/location vector for Q, 
h, we defined Q{h) to be the word- function giving the 
value of the location corresponding to h after the 
execution of the path h, and we showed that Q(h) was 
P-representable. so suppose 3 ia a P-function, Q* ia 
a Iwf, and: 

<2'>i>l> Q <8;h) => Q(h) - Q*(s)] 

For two schemata Q,R, the condition diff Q R (h, k) on 
h and k that Q(h) t R{k) is the P-formula: 

where t^ R# is the predicate of Theorem 6.1 for the 
lwf 's o* and R*. 

Theorem 6.2 For any two schemata 0,R, there is a 
P-formula, *q^ r , free in the variables h,k, such that: 

y Q r&'2S) is true if and only if the paths 
corresponding to h and k in the schemata Q and R are 
consistent, and represent execution sequences. 
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Proof. t_ (h,k) is the formula: 
Q,R 



T?(h ,h}^T° 



^ .iQl^hJ .->. diff^th^) 



j <h '£l'W 



* s\ 



: ^(h^h) a tJ^^) -=>- diff^Ch^) j 



. : ^ : . f in Q (h) V »&. Q <h) V inf° <h ) 



.:^ : . fin Q R {k)^ fi*^ Qe)v £sf*tt) 



Theorem 6.3 Given schemata Q,R, in S, there is an effective 
construction of a formula Eq(Q,R) of P which is true if 
and only if are R are equivalent. 

Proof . Eq (Q,R) is the formula: 



(h,k) 



. v . f inj (h) * f inj* {k ) 
.%•. inf Q <h) ^ inf R (k) 



The theory P is decidable 



[,]. - 



we have proved: 
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Theorem 6.4 The equivalence problem for schemata in 
the class S is recursively solvable. 
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$7. Free, liberal and progressive schemata 

j7. 1 Freeing liberal schemata 

Definitions . A schema P is free is every sequence through 
p is an ececution sequence, 

A schema P is liberal if, in every sequence through P, no 
expression (under a free interpretation) is computed 
more than once. 

Theorem 7-1 Given any liberal schema p, there is an 
effective construction of an equivalent free schema p . 
(This construction will be called 'freeing P',) 

Proof . We have only to ensure that no location P is 

tested more than once by the same test-function between 

successive assignments to it, for then, any sequence is 

necessarily consistent and therefore an execution 

sequence. The construction of P is by stages. At each 

stage, for some u,j, we 'free the schema with respect to 

T (L ) ' , in the following way. 
u j 

Three copies of P are made, P ,P°,P , say. The 
addresses ccrre^nnding to each *a ' in p will be denoted 
by a ,a ,a respectively and the transfer addresses in 
each copy altered accordingly. Any test in P of the form: 
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IS 


replaced 


in pN 


by: 
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(L.) 
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b ., 
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go 


to b 1 


} 





and in P^ by: 

go to c . 
Any computation instruction in P of the form: 

a • Ija ; = - - * # * 
is replaced in both P and p by; 

go to a H . 
The initial address o£ the new schema is a 0# where a n is 
the initial address of P. Since the part of the new 
schema corresponding to P is only accessible when the 
expression in location L^ has already been tested by T u 
and T u (Lj) * 0, and similarly for P 1 , it is clear that 
the new schema is equivalent to P. Furthermore, in the 
new schema L* can be tested at most once by T u between 
assignments. 

When we have successfully freed P with respect 
to each pair of location and test symbol, the result is 
the new equivalent free schema P*\ 

The application of this construction to a simple 
schema is illustrated in flow-diagrammatic form in Figs- 
(i), (ii) and (iii) on the following pages. (Note that 
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Fig, 1- 

Schema P 




£ 



Pig . 2 • Schema P' 
? treeci w.r .t . T(L) 
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F ig . 3 - Schema P" 

' P' freed w.r.t » T(Mj 





N:* 



h{l,n) 



A. 



*Q 



N-- F(L.M)J) 




Fig . 4 . Abbreviated form of P" . 

1. L :» G(N) *,2 

2. N := H(L,N) 3,e Q 

3. K :- F<L,M,N) 3,© Q 
t. goto "■ 
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for the sake of clarity various 'inaccessible' parts of 
the schemata have been omitted.) 

Without losing any generality, we may assume 
that in any schema, no location is tested before it has 
been assigned to, because given any schema P wo can, if 
necessary, introduce a new monadic operator K, say, and 
precede the schema by the new instructions; 
L x s- K(L X ) 
L 2 := K(L 2 ) 



etc. 
to produce the schema P*. Clearly, 

P 5 Q if and only if P' - Q' 
and if N(P') is a canonical form for P' r then a canonical 
form for P is produced by just deleting from NiP 1 ) all 
those instructions involving K. 

Suppose we take any schema satisfying this condition, 
and, for each location L* insert after each computation 
instruction assigning to L the trivial sequence of test 
instructions, 

T 2 (L) '+1, '+1 

* 
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where T ,...,T are all the test symbols. The resulting 
1 k 

schema is equivalent to the original one and we note 
that, after applying the freeing construction described 
in Theorem 1, the new schema still has the property 
that each location is tested in turn by each test symbol 
after each assignment to it. Following this sequence of 
tests, the next instruction may be one of up to 2* 
successors- For our purposes it is a convenient abbrevi- 
ation to express the computation and succeeding tests 
in a single statement as, say: 

L 3 : - F s (L 3l V *1'*2 V 

so the next instruction has address a where 

- - i '*2J *"*-w 

We are therefore considering schemata whose statements 
are all either of this form or else are unconditional 
transfers. The latter, however, are clearly eliminable, 
provided that we allow just one new instruction of the 
form; 

f . goto * 
which provides an equivalent for any loop consisting 
entirely of unconditional transfers. 

At this stage it is convenient to make some 
further simplifications* The initial address is marked. 
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All successors of marked addresses are marked and this 
step is repeated until no further addresses can be marked. 
The marked addresses are then precisely those which are 
accessible by a sequence (and so an execution sequence) 
through the schema. All other instructions can be 
deleted, without loss of equivalence. In a similar way, 
though proceeding in the reverse direction, we can find 
all addresses from which either of the terminal addresses 
is accessible. All other instructions can be deleted and 
transfers to them replaced by transfers to *. Finally, 
if all the successors of an instruction are e {e. respect- 
ively) then the instruction can be deleted and transfers 
to it replaced by transfers directly to e {e^ respectively). 
In some trivial cases, sensible alternatives to these 
simplifications are required. 

As a result of these considerations we state, 
without further proof: 

Theorem 7.2 There is an effective method whereby, 
from any liberal schema, an equivalent liberal schema 
may be constructed with the following properties; 
(i) it is free 
fii) it is expressible in the abbreviated form given above, 
(iii) any (partial) sequence in it which has not reached 

*, may be extended to a terminating execution sequence, 
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{iv) any (partial) sequence in it which has not 

reached e (e respectively) may be extended 
to an execution sequence which never reaches 
€q (e, respectively) - 

It seems likely that the decision problem for the 
equivalence of free schemata is solvable, and certainly 
none of the techniques we have used so far to prove 
unsolvability is at all applicable. Theorem 7.1 shows 
that, for all practical purposes, any liberal schema can 
be considered to be free, but the reverse is not the 
case. For example, the free schema: 



L 2 *- F(L X ) 


L x r= Tihj) 


L x := r{L ) 


L 2 j= F(L 2 ) 


TtLi) *+l, a 


L 2 := G{L 2 ) 


T(L 2 ) e Q . ei 



can be readily shown not to be equivalent to any liberal 
schema. The difference between the two classes is apparent 
too in the next two theorems* 
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Theorem 7 . 3 The property of being free is not recursively 
solvable ■ 



In the proof of this Theorem we shall require a 
result of E.L. Post [ill- The 'correspondence problem' 
is the following; 



Given a set of pairs of words {<U.,V.^|i = X 4 



over a finite alphabet, to decide whether there 
exists a sequence of indices i, # i ? ,...i, (k > 1) 
where 1 < i. <? n, such that; 

*I *2 \ X l *2 ** 

Post proved that the correspondence problem (for an 
alphabet with more than one letter) is recursively solvable, 



Proof of Theorem 7.3. For any word W ■ F-F,»-*F L over an 
w a b n 



i 



; p ' F i""« p m i ' let: 



w 



U)i 



J 



ana: 



=t 



w 



[2) 



J 



denote the sequence 
of instructions: 



i 



L X :- P b { Ll ) 



L l I= V L 1> 



the corresponding sequence 



with L 2 in place of L.. 



T 
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denotes 




Consider the schema* 



i__ 







u!" 






1 




\<« 




Vj-1 







u. 1 .? 








ui" 






M 




W> 




This is evidently free if and only if the correspondence 
problem I <V± V-> t . . . , < U n ,V >| has no solution. / 
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Theorem 7.4 The property of being liberal is recursively 

solvable, 

proof . For any sequence in which the same expression is 
computed twice, we consider the first such occurrence. 
We observe that, for this to happen, the two relevant 
computation instructions must be identical up to the 
choice of assignment location, and none of the retrieval 
locations is assigned to between them. Let us call this 
an immediate repetition . A schema is therefore illiberal 
(i.e. not liberal) if and only if there is an immediate 
repetition in some execution sequence. We can apply the 
construction of Theorem 7.2 to any schema, and produce an 
equivalent schema which is free if the original is liberal, 
but is illiberal if the original is illiberal (because 
corresponding sequences of computation instructions are 
the same) . The decision procedure is merely to look for 
basic repetitions in any sequence through this schema. If 
there is such an immediate repetition, then there must be 
one where the number of intervening instructions is less than 
the total number in the schema, therefore the search can 
obviously be made finite. If no immediate repetition is 
found, then, if the schema is free it has been found in an 
execution sequence, and if not this can only be because the 
original schema was illiberal. / 
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>7*2 Progressive schemata 



Definition . A schema P is progressive if, in any 
sequence through P, the assignment location of any 
computation instruction is taken as a retrieval location 
by the next computation instruction, if any. 

The 'expression computed by an instruction' in an 
execution sequence will mean the string which is produced 
under the corresponding free interpretation. 

(i) The lengths of successive expressions computed by a 
sequence through a progressive schema are strictly increasing 
(ii) Given any expression E computable by a sequence through 

a progressive schema, there is a unique integer n, and a 

unique sequence s r of expressions 
S = <E , .. . ,E =E*> 

such that any sequence of a progressive schema which computes 

E computes precisely the expressions of s in turn. 

Proof. 



(i) Each expression must contain the previous one as a 
proper subexpression, 
(ii) E is uniquely decomposable into its immediate 
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subexpressions and either each is an initial symbol, i.e. an 
L., or else one of the subexpressions was the expression 
computed immediately before E and so is distinguished by 
being the longest, in view of (i) . The result is then proved 
easily by induction on the length of a sequence which computes 
E« / 



Corollary of Lemma l{i) . Any progressive schema is liberal. 

Definition - Let depth (E) be the number n of Lemma 1 for 
suitable E (and undefined otherwise} ■ 

Lemma 2 . Let t 1 ,t be sequences through the progressive schema 
P, which compute E^, E 2 respectively, and suppose that 

depth (E,) - depth (E ) 
but E. ^ E ? . Let tj\t ' be continuations of t #t~ 
respectively, which compute E, ' and E *, say- Then Ei ' ^ E 2 ' • 

Proof . Trivial from Lemma l(ii). / 

Suppose P ( Q are free, liberal, progressive schemata 
with the properties described in Theorem 7.2. 

Theorem 7.5 If P ; Q, then for all interpretations I, 
the execution sequences ff j and * ^' compute precisely 

the same sequence of expressions. 
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Proof. Suppose * only computes n expressions in the 

sequence tj, which then terminates or reaches the address *, 
while ff T computes an (n+l)th expression in the sequence 

to- By Theorem 7.2, there is a continuation t 2 ' of t ? 
which behaves differently from t , and clearly t* and t2* 
are consistent. So P ^ Q. 

Alternatively, suppose the (n+l)th expressions calcu- 
lated by * t and * * Q * exist and are different, and 
denote the sequence thus far by t and t respectively. 
There is some continuation, t, ' say, of t- wnich terminates 
at e(-e or e^) • There is also some continuation t 2 ' of 

t which never reaches e. Since t. and t are consistent 

2 1 2 

and, by Lemma 2, the sets of expressions computed in the 

continuation parts of t ' and t' are disjoint, it follows 

that t ■ and t ■ are consistent, and so P ^ Q. These 
1 * 

contradictions prove the theorem, / 



In a 57.4 we shall produce an algorithm to decide the 
equivalence of any progressive schemata, however it seems 
desirable to present first the easier decision procedure for 
progressive schemata of a more restricted class. 

57,3 Full schemata 

Definition* A full (or fully progressive) schema is one 



in which each computation instruction takes every location 
as a retrieval location. 

Corollary . A full schema is progressive. 

Lemma 3 . Tor any two sequences in full schemata which 
compute the same sequence of expressions, the two 
corresponding sequences of computation instructions are 
identical oxcept perhaps for the last instructions of 
the sequence, whose assignment locations may differ. 

Proof . Assume that the result is true for sequences 
which compute just r expressions and further that for 
such sequences the two corresponding free computation 
sequences are identical except perhaps after the last 
ire tructions. This hypothesis is trivially true for 
r = 1. Suppose it to be true for r < k (k > 1) , and 
suppose two sequences compute identical sequences of 
expressions of length k. The elements of each member of 
the free computation sequence are clearly distinct and 
each occurs as a subexpression of the next computed expres- 
sion, so the penultimate computations must replace the 
same expressions. Hence the penultimate assignment 
locations are the same and therefore so are the next 
members of the free computation sequence. The last com- 
putation instructions must therefore agree to within 
choice of assignment location and so the hypothesis holds 
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for r = k. The result follows by induction, / 

Now we define an equivalence relation - , on the 
statement addresses of schemata in the abbreviated form 
described in Theorem 7.2* 

Definitions . If address a has the successor addresses 
A X* * ** ,a yk i then we define: 

succ^ (a) = ^ i for i = 1 2 k 



The instruction associated with the address a is 
denoted by inst la) ■ 

Definition of ^ . 

Clause f i) , 

a - b if and only if inst (a) = inst(b), and for each 

i = 1 2 k , succ i (a) - succ i (b) 

(or succ i (a) - succ^fb) if either on* is 

e 0' e l or T '* 
Clause (ii> . 

- is the weakest relation satisfying Clause (i) . 

Lemma 4 . The definition of - is welldefined and effective. 

Proof. For certain pairs a,b, Clause (i) gives a / b. 
This set of pairs can be effectively constructed by 
stages. At stage n (n > 1), we add just those pairs 



whose inequivalence follows from clause (i) and, if 
n > 1, the pairs added at the previous stage. When 
this construction terminates, wo can clearly define all 
the remaining pairs to be equivalent, without obtaining 
a contradiction from Clause (i) . The relation so defined 
also satisfied Clause (ii) . / 

Lerrana 5 * — (and indeed any relation satisfying Clause 
(i>) has the 'substitution property 1 , that is, 

'Given any schema P, with addresses a,b where 
a — b, the schema P', obtained by replacing an 
occurrence of 'a 1 in a transfer address by 'b', 
is equivalent to P." 

Proof . Under any interpretation, the execution sequences 
in P,P* can be shown inductively to have corresponding 
addresses equivalent and so containing the same instructions, 
Hence the values of the two sequences are the same. / 

Construction of the 'canonical form 1 P . 

Given any full schema p, we can effectively construct 
an equivalent schema satisfying the conditions of 
Theorem 7.2. Assuming now the abbreviated form, we 
replace the assignment location in any statement all of 
whose successor addresses are r , e n or e l ^Y some fixed 
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location, L^ say. This produces, of course, an equivalent 
schema. Now we select one from each —equivalence class 
of addresses and replace each transfer address by the 
chosen address of its class. Repeated application of 
Lemma 5 shows that the resulting schema is equivalent to 
the original one- With the representative from the class 
containing the old initial address as the new initial 
address, all except the chosen representatives are inac- 
cessible and may be deleted, leaving a set of addresses 
no two of which are — equivalent. This schema with a 
suitable renaming of the addresses is P. One method of 
naming which suggests itself is to label the initial 
statement with '1', and thon proceed by using the lowest 
integer not yet used, to label the earliest successor of 
the lowest-labelled statement that is not yet labelled. 
By Theorem 7.2, each statement is accessible and so gets 
labelled eventually. 

We require to prove that for any equivalence class 
of schemata the canonical fcrm obtained as described is 
unique. 

Lemma 6 . if p= 0, then corresponding statement addresses 
of any consistent pair of execution sequences through P 
and Q are *- -equivalent. 
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proof. Suppose P = Q, and that x ■ x^au, .*■ and 

y = Y^y-*--- are such a pair of sequences and that 

x r - y r for r < n, but * n / Y n * 

Then, by the proof of Lemma 4 r there is a consistent 

pair x',y ' of extensions of x , .. , ,x and y , . . .y 

respectively, such that for some m > n, either 

inst(x l m ) j£ inst (y* ), or one of x' ,y' is e r e or 

m m m 1 

* and x* ^ y' . Prom Theorem 7.5, Lemma 3 ( and the 

m J m 

consideration that the exceptional case of Lemma 3 has 
been explicitly prevented, the first case is impossible. 
By Theorem 7.5, x' and y' reach one of e^e,,* simul- 
taneously and so the second case is a contradiction of 
PSO. / 

Theorem 7.6 . If p,Q have canonical forms ^,2* as des- 
cribed above then: 

P = Q if and only if p and $" are identical. 

Proof . Since a schema and its canonical form are equi- 
valent: 

P,Q identical implies P s Q. 
For the reverse implication, if P s Q, then by Lemma 6, 
the initial statements of P,Q f and hence of S^, are 
*■ -equivalent. 3y the definition of ^ , corresponding 
successors of equivalent addresses are equivalent, and. 
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by the construction of the canonical form, there is 
only one representative of each equivalence class in 
each schema. Therefore P,Q are the same to within, 
possibly, renaming of addresses, but both were named in 
a standard way which depended only on the structure, so 
p,Q are identical- / 



Hence we have: 

Theorem 7,7 , The problem of deciding whether or not 

two given full schemata are equivalent is recursively 

solvable. 

Corollary , The equi\elence problem for schemata which 
use only one location is recursively solvable. 
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j7.4 Decision procedure for progressive schemata . 

We can assume at once that any progressive schema 
is already in the abbreviated form guaranteed by 
Theorem 7.2. For any statement: 

a. Lj :- F^d^ L k ) * 1 » ) 

we define t 

f "t a > = Fu • 

the assignment index of a , asg(a) = j, 

the retrieval vector of a . Ret {a) ■ < ki i - . . ,k > 
and denote the i component of Ret (a) by ret 4 (a) ■ 
A location L . is alive at address a if there is a sequence 
through the schema from a: 

a = b ,b 1/ b 2 , •••#b |] (n > 0) 
such thati 

{i) jG Ret(b n ) 
but (ii) j ^ asgfb,) for any i, £ i < n. 
{No location is alive at a terminal address.) If there 
is such a sequence, then there is clearly one of length 
less than or equal to the number of statements in the 
schema, so there are effective and efficient algorithms 
for determining the sets 

Live (a) M t j I L. alive at a 



def * ' J 
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For any point r (n) of a sequence x through any 
schema, we define the free configuration , C( * ,n) , as 
the (m+1) -vector: 

C( *,n) - < »r{n) # A(')(n) > 
and the reduced configuration , C ' ( * t n) , 
C'( w ,n) = <*(n), A'( ir ) (n)> 
where A'{r)(n){j) *A(»)(n)(j) if L. is alive at ir(n), 

■ otherwise. 
Also Range U,n) is the set of all non-zero elements of 
A'(»)(n) : (which are necessarily all distinct). 

It is evident that, for any interpretation and 
for any point of the corresponding execution sequence, 
the continuation and value of the sequence depend only 
on the reduced configuration at that point. 



LenaM 



7« P = Q implies that, for every interpretation 



I and integer n: 

* 

Range( ^W^j = Range{ * j* * ,n) . 

Proof , Suppose *,*' are consistent sequences in P,Q 
respectively and some expression E is in Range (*,n) 
but not in Range (*',n)* There is a sequence continuing 
from Mn), in which £ is taken as an argument in the 
computation of some expression £*, say, but no continuation 
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from * * (n) can ever use E as an argument and so can ever 
compute E*. Thus there is an interpretation for which 
the two corresponding execution sequences compute differ- 
ent sequences of expressions, and so, by Theorem 7.5, 
P t Q- / 

Let m be the (fixed) number of location symbols 

involved in the schemata under consideration, and let s 

be tne group of all permutations on m elements. If 

# C Spj, we define for any vector of integers in the 
I 

i 



! 1 n } : 



* (<k r —,* t > ) = < * (1^),..., * (k t ) > 
and for a set of such integers: 
*<T) = j*<j) | j C T 

Definition . Address a is * -similarto address b, 
a * < (b) , when : 

(I) if a,b are terminal or equal « (the 'dynamic 
stop 1 ) , then a = b? 
{II} if a,b are prefixed of statements, then: 
(i) fn(a) = fn{b) 
(ii) Ret (a) = * (Ret(b)) 
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(iii) either asg(a) = Mftsg(b)) 

QrTasg(a)£ u * (Live (succ^b) )) 
and v<asg(b))tt u Livo{succ i (a) ) 

(iv) Live(a) = *(Live{b)) 

* a ' h is defined as * . (rs) where r - asg(a) and 

5 e y- (asg(b)) (i.e. the permutation * followed by the 

transposition of r and 5) ■ 

For any configuration a ■ < a, I) > , we define: 

■ 

*[d1 = < D ,.-.,D , > 

L J f 1 (l) * (m) 

If configuration & immediately follows a in an execution 
sequence corresponding to some interpretation I, we write: 

I 



or just <* 



Let a - < a,A >, 8 = < b,B > , y - < c,C > , £ - < d, D > 
denote general reduced configurations. 



Lemma 



8. Suppose A = # S I* '^ e expressions computed at 



a and ft are the same, and for all interpretations I, 

Range { 7 ) = Range ( & ) where a > 7 and fl • 

then: 

a,b 



(1) C ■ * a ' [d! for such 1, 

(2) a ^ (b) 
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Proof. Part (1) is trivial. For part (2), the conditions 
(i) and (iv) of the definition of ' a - * (b) ' are 
clearly satisfied, and since the non-aero components of 
any reduced configuration are always distinct, so also is 
condition (ii) . Suppose that* 

asg(a) f *(asg(b)) and *(asg(b}) £ Live(c) 
for some t = <c,£ > , say, where o — ^"V . Then: 

" c *(asg(b)) sin « only L a sg{a) is 
assigned to. 

" D (* a **)- l (# (asg(b)) ) 
= D ^- 1 (asg(a)> 

" B ^ fcl (asg(a)J 
which is impossible. A similar argument is followed if 
asg(a) C <r(Live(d)), and so condition (iii) is satisfied, 
and part (2) is proved. / 

For any two sequences r t *\ we defined perm(r,n' ,n) by: 
perm(% * f , 0) = £ = identity permutation 
perrM*, ir',n+l) = [penning - #n j] *' n >' r '( n > 

for n > 0. 
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Theorem 7,8 , For any schemata F,Q: P = Q if and only if 
for all interpretations I and integers n: 



, <»(„> .*(»,»« J 



(0) 



where * = perm(* z * r y #n) 

proof . The sufficiency condition follows from Part (1) of 
the definition of * -similarity. The necessity is a con- 
sequence of Theorem 7,5, Lemma 7, and Lemma 8, and is 
proved by induction on n. / 

A triple <a,b,*>is valid if a ^ *{b), and invalid 
otherwise. A triple <c,d,^' > follows a triple 
< a,b, P > if *' «**' , and for some i, c = succ^a) 
and d ■ succ^(b) . A chain of triples is a sequence 
x^x^x^... of triples, where x n+1 follows x fl for all n > 1, 

Theorem 7.8 can be alternatively expressed as: 
1 P = if and only if there is no chain starting 



from <p ,q Q , ( > , where Pq'^q are the initial 
addresses of P,Q respectively, and containing an 
invalid triple. ' 
Since it is effectively determinable whether or not a given 
triple is valid, and there are only a finite number of 
triples involved, we have: 
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Theorem 7.9 . It is recursively solvable, given any two 
progressive schemata P and Q, whether or not P ■ Q. 
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$7 .5 Conservative schemata 

A conservative schema is one in all of whose compu- 
tation instructions the assignment location appears as a 
retrieval location, eg. : 

L 2 :- H(L 3 ,L 2 ,L 2 ) 

Lenroa . Any conservative schema is liberal. 

Proof . The proof follows by induction when we note that: 
(i) the expressions associated with the locations 
are all distinct, 
(ii) whenever any new expression is computed, one of 

its subexpressions is lost by being 'overwritten* , 

so the computation cannot be repeated, / 

If we delete the first instruction: 

from each of the simulation schemata of §5, the resulting 
schemata are conservative. There is an obvious isomorphism 
between conservative schemata with a single monadic func- 
tion symbol F, say, and m locations and m-tape finite 
automata. Given such a schema, we prefix it with the new 
sequence of instructions: 
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L 2 :- F(L L ) 



L m :- PtL^) 



and then we can construct the equivalent free schema, 
as in Theorem 7.2. To each abbreviated statement: 



a . L: 

Q x 



F < L i> a i d ? k 



of this schema, we make correspond the state q in an 
m«-tape automaton with the transitions; 





head 


next state 


state 


a l s 2 * " * S 2* 


*0 


i 


a l a 2 * * ' a 2 k 



The trivial loop at n corresponds to a trivial loop in 
the automaton. The reverse correspondence is obvious. 

Theorem 7 .10 . Two such schemata are equivalent if and 
only if their corresponding m-tape automata are equivalent 
(that is, accept and reject the same sets of tapes) - 



Proof . We just extend the correspondence by noting 
that an m-tape automaton accepts, rejects, diverges on, 
respectively, an m-tuple of tapes, just as the correspond- 
ing schema succeeds, fails or diverges, under the 
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Interpretation which corresponds with the m-tuple in 
the obvious way. Any interpretation corresponds to an 
m-tuple of tapes. / 

The equivalence problem for multitape finite 
automata is well known, but to date is unresolved. 
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SUMMARY 

In 52 and S3 we introduced program schemata, gave 
some justification of the reasons behind our definitions, 
and began to discover the sort of problems we ought to 
investigate. If the formalism is to provide the basis of 
an adequate optimization procedure, it is preferable 
that the equivalence problems involved be at least par- 
tially solvable. In Part II we began to chart the unsolv- 
able domains that would have to be avoided. Some of the 
general conclusions we can draw, as a result of this, are: 

(1) By electing to restrict the range of possible 
interpretations in some general way, we can gain, at most, 
some partial solvability results. 

(2) It avails us nothing in solvability to reduce 
the number of function and test symbols to the minimum 
which leaves a nontrivial language, and unsolvability is 
with us already in schemata with just two locations. 

(3) We do not require arbitrarily complicated loop- 
structures in the flow— diagrams for unsolvability, but 
we can hope to find decision procedures under certain 
nontrivial constraints on the structure. 

In Part in we started to investigate some of the 
possible solvable domains, in $6 for example, we found 
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a decision procedure for a class of schemata suggested 
by (3). While this is encouraging, we shall have to 
extend this result somewhat, before it can have very 
general application, A very noticeable feature of the 
schemata used to prove unsolvability results in Part II 
is the high degree of repetition of computation and test- 
ing of values* This suggested that some attention be 
given to 'free' schemata, which never test the same value 
twice, and to 'liberal 1 schemata, which never compute the 
same value twice* In $7*1 we derived some information 
about the relationship between these different classes, 
but wo required the further restriction to 'progressive 1 
schemata, before a decision procedure emerged {$7*4). 
The final short $7-5 on 'conservative' schemata merely 
serves to introduce a class of schemata, with a potentially 
solvable decision problem, and which is wide enough to 
encompass a usefully large variety of schemata* A solu- 
tion to the equivalence problem for multitape automata 
would be a major step toward a decision procedure for these 
schemata, and we await this solution with interest. 

The need now arises to develop techniques by which 
actual computer programs may best be expressed in the form 
of program schemata, and if possible program schemata in 
a solvable domain* This abstraction may be carried out at 
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several levels in turn. At the 'deepest' level, we may 
express single 'machine-orders', or even parts of such 
orders, as schema statements and consider simplifications 
of the program, a small portion at a time. For 'straight- 
through* sequences of instructions, existing ccanpilers 
use various ad hoc techniques for, say, eliminating redun- 
dant orders or collecting together common subexpressions 
of expressions to be calculated, and it would be valuable 
to have a formalism in which to express these techniques, 
and to develop new ones in a 'machine-independent' form. 
At 'shallower* levels of simplification, we might use one 
schema statement to represent a whole subprogram or routine, 
operating perhaps on complex data structures. Abstraction 
to a schema would allow the convenient manipulation of 
these blocks of instructions and permit simplifications 
of a grosser kind. 

These ideas have provided the motivation for the 
present research, but we do not discuss the practical appli- 
cations in any detail in this dissertation. We have set 
down a theoretical basis of this model of computation, 
investigated some of its immediate ramifications and 
indicated scene of the directions in which we hope to progress 
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